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H uinan N-Methvl-D-A sp^rt-at^ Receptor J^bunit^ 
Nucleic _Acjds Encoding _Sa.™* and Uses Therefor 

This application is a continuation-in-part of 
United States Serial No. 08/052,449, filed April 20, 1993, 
now pending- 

The present invention relates to nucleic acids 
and receptor proteins encoded thereby. Invention nucleic 
acids encode novel human N-methy 1-D-aspartate (NMDA) 
receptor subunits. The invention also relates to methods 
for making such receptor subunits and for using the 
receptor proteins in assays designed to identify and 
characterize compounds which affect the function of such 
receptors, e.g., agonists and antagonists of NMDA 
receptors . 

BACKGROUND OF THE INVENTIO N 

The amino acid L-glutamate is a major excitatory 
neurotransmitter in the mammalian central nervous system. 
Anatomical, biochemical and electrophysiological analyses 
suggest that glutamatergic systems are involved in a broad 
array of neuronal processes, including fast excitatory 
synaptic transmission, regulation of neurotransmitter 
releases, long-term potentiation, learning and memory, 
developmental synaptic plasticity, hypoxic-ischemic damage 
and neuronal cell death, epileptiform seizures, as well as 
the pathogenesis of several neurodegenerative disorders. 
See generally, Monaghan et al., Ann. Rev. Pharmacol 
Toxicol. 29:365-402 (1980). This extensive repertoire of 
functions, especially those related to learning, 
neurotoxicity and neuropathology, has stimulated recent 
attempts to describe and define the mechanisms through 
which glutamate exerts its effects. 

Currently, glutamate receptor classification 
schemes are based on pharmacological criteria. Glutamate 



has been observed to mediate its effects through receptors 
that have been categorized into two -in groups: 
ionotropic and metabotropic. lonotropic glutamate 

receptors contain integral cation-specific, ligand-gated 
5 ion channels, whereas metabotropic glutamate receptors are 
G-protein-coupled receptors that transduce extracellular 
signals via activation of intracellular second messenger 
systems. Ionotropic receptors are further divided into at 
least two categories based on the pharmacologxcal and 
10 functional properties of the receptors. The two main types 
of ionotropic receptors are N-methy 1-D-aspartxc acxd (NMDA) 
and *ainic acid (KA) /a — ami no -3 — hydroxy— 5— methyl— isoxazole— 
4-propionic" acid (AMPA) , formerly called the guxsgualxc 
acid, or QUIS, receptor. While the metabotropic receptors 
15 bind to some of the same ligands that bind to ionotropxc 
glutamate receptors, the metabotropic receptors alter 
synaptic physiology via GTP-binding proteins and second 
messengers such as cyclic AMP , cyclic GMP, diacy Iglycerol , 
inositol 1,4,5-triphosphate and calcium [Gundersen et al., 
20 Proc. R . Soc. London Ser. 221:127 (1984); Sladeczek et al., 
Nature 317:717 (1985); Nicoletti et al., J. Neuroscx. 
6:1905 (1986); Sugiyama et al., Nature 325:531 (1987)]. 

The electrophysiological and pharmacological 
properties of the glutamate receptors have been studied 
25 using animal tissues and cell lines, as well as 
recombinants produced non-human receptors, as the source 
of such receptors. The value of such studies for 
application to the development of human therapeutics has 
been limited by the availability of only non-human receptor 
30 subunits. Moreover, it is only recently that the 
characteristics and structure of glutamate receptors have 
been investigated at the molecular level. The majorxty of 
such investigation has, however, been carried out in non- 
human species. Because of the potential physiological and 
35 pathological significance of glutamate receptors, xt would 
be desirable (for example, for drug screening assays) to 
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have available human sequences (i.e., DNA, RNA, proterns, 
which encode representative .e.bets of the various 
qlutamate receptor subtypes. The availability of such 
human sequences will also enable the investigation of 
receptor distribution in humans, the correlate of 
specific receptor modification with the occurrence of 
various disease states, etc. 

BRIEF DESCRIPT ION ™ ™r TNVENTION 

The present invention discloses novel nucleic 
acids encoding NMDA receptor protein subunits and the 
proteins encoded thereby. In a particular embodiment the 
novel nucleic acids encode NMDAR1 and NMDAR2 subunits of 
human NMDA receptors. More specifically, the invention 
nucleic acids encode NMDAR1 , NMDAR2 A , NMDAR2B , NMDAR2C and 
15 NMDAR2D subunits that contribute to the formation of NMDA- 
activated cation-selective ion channels. In addition to 
being useful for the production of NMDA receptor subunit 
proteins, these nucleic acids are also useful as probes, 
thus enabling those skilled in the art, without undue 
20 experimentation, to identify and isolate nucleic acids 
encoding related receptor subunits. 

Functional glutamate receptors can be assembled, 
in accordance with the present invention, from a plurality 
of NMDA receptor subunit proteins of one type (homomenc) 
25 or from combinations of subunit proteins of different types 
(heteromeric) . 

in addition to disclosing novel NMDA receptor 
protein subunits, the present invention also comprises 
methods for using such receptor subunits to identify and 
30 characterize compounds which affect the function of such 
receptors, e.g., agonists, antagonists, and modulators of 
glutamate receptor function. The invention also comprises 
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4 

methods for determining whether unknown protein(s) are 
functional as NMDA receptor subunits. 

PPT ^ nr<;cRTPTION OF THE FIGURES 

Figure 1 is a schematic representation of various 

^an NMDAR1 clones of the invention, with partial 

restriction maps of each clone. The clones are aligned and 

the differences in the DNAs (i.e., deletions and 

insertions), relative to clone NMDA10 , are indicated. 

■, 4.- Vitiation and termination sites are 

Translation initiation auu 

represented by a »V» and a — , respectively. Insertions 
are marked' as inverted triangles, deletions are indicated 
by spaces in the boxes. The numbers above the insertions 
and deletions refer to the number of nucleotides inserted 
or deleted relative to NMDA10. 

Figure 2 is a schematic representation of cDNAs 
encoding full-length human NMDAR1 subunit subtypes of the 
invention, with partial restriction maps of each DNA. The 
full-length cDNAs are constructed by ligation of 
appropriate portions of the clones shown in Figure , 1 
Regions of each full-length cDNA composed of nucleotide 
sequences corresponding to a particular clone are 
distinguished as solid, striped, cross-hatched or open 
boxes . 

Figure 3 presents the entire nucleotide sequence 
of construct NMDAR1A (see Sequence ID No. 1) with the 
following information added for ease of comparison of the 
splice variations of the NMDAR1 subunit transcript: 
lowercase letters indicate 5' untranslated sequence and the 
3- untranslated sequence of the NMDAR1 splice variant shown 
in Sequence ID No. 1 (in some of the other splice variants, 
this 3' untranslated sequence is actually coding sequence) ; 
uppercase letters indicate coding sequence; the translation 
initiation codon is identified by the word " START" whereas 



the three different translation termination codons (TGA) 
used in the different splice variants are identified by 
small boxes; significant restriction enzyme sites used m 
preparing full-length variant constructs are identified by 
name above the sites; the location of a 63-bp insertion 
(see Sequence ID No. 3) that exists in some of the variants 
is marked as "63 bp INSERT"; the nucleotide sequences that 
are deleted from some of the variants are boxed and labeled 
as "204 bp DELETION," "363 bp DELETION," and "1087 bp 
DELETION. " 

Figure 4 is a schematic representation of various 
human NMDAR2C clones of the invention, with partial 
restriction maps of each clone. The clones are aligned and 
the differences in the DNAs relative to clone NMDA26 are 
5 indicated in the same manner as done in Figure 1. 

Figure 5 is a schematic representation of full- 
length human NMDAR2C subunit subtypes of the invention, 
with partial restriction maps of each DNA. The full-length 
cDNAs are constructed by ligation of appropriate portions 
0 of the clones shown in Figure 4. Regions of each full- 
length CDNA composed of nucleotide sequences corresponding 
to a particular clone are distinguished as solid, striped, 
cross-hatched or open boxes. 

Figure 6 presents restriction maps of CMV 
5 promoter-based vectors pCMV-T7-2 and pCMV-T7-3 . 

nKTATT.ED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, there 
are provided isolated nucleic acids encoding human 
N-methyl-D-aspartate (NMDA) receptor subunit (s). In one 
0 aspect of the present invention, nucleic acids encoding 
NMDA receptor subunit(s) of the NMDAR1 subtype are 
provided. In another aspect, nucleic acids encoding NMDA 



receptor subunit(s) of the NMDAR2 subtype are provided. In 
a further aspect, eukaryotic cells containing such nucleic 
acids, and eukaryotic cells expressing such nucleic acids 
are provided. 

5 Also provided are protein(s) encoded by the 

above-described nucleic acids, as well as antibodies 
generated against the protein(s). In other aspects of the 
present invention, there are provided nucleic acid probes 
comprising at least NMDA receptor subunit-select ive 
10 portions of the above-described nucleic acids. 

As employed herein, the phrase "human N-methyl- 
D-aspartate (NMDA) receptor subunit(s)" refers to 
recombinant^ produced (i.e., isolated or substantially 
pure) proteins which participate in the formation of a 

15 voltage-sensitive cation-selective channel activated by 
exposure to NMDA , and having at least one transmembrane 
domain, a large N-terminal extracellular domain, and the 
like, including variants thereof encoded by mRNA generated 
by alternative splicing of a primary transcript, and 

20 further including fragments thereof which retain one or 
more of the above properties. 

Use of the phrase "recombinantly produced", 
"isolated" or "substantially pure" in the present 
specification and claims as a modifier of DNA, RNA, 

25 polypeptides or proteins means that the DNA, RNA, 
polypeptides or proteins so designated have been produced 
in such form by the hand of man, and thus are separated 
from their native in vivo cellular environment. As a 
result of this human intervention, the recombinant DNAs , 

30 RNAs, polypeptides and proteins of the invention are useful 
in ways that the DNAs, RNAs, polypeptides or proteins as 
they naturally occur are not, such as identification of 
selective drugs or compounds. 
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The term "functional", when used herein as a 
modifier of receptor protein (s) of the present invention, 
means that binding of NMDA (or NMDA-like) ligand to 
receptors comprising the protein (s) causes the receptor 
"ion channels" to open. This allows cations, particularly 
Ca 2+ , as well as Na* and K + , to move across the membrane. 
Stated another way, "functional" means that a signal is 
generated as a consequence of agonist activation of 
receptor protein (s). 

As used herein, a splice variant refers to 
variant NMDA receptor subunit-encoding nucleic acid(s) 
produced by differential processing of primary 
transcript (s) of genomic DNA, resulting in the production 
of more than one type of mRNA. cDNA derived from 
differentially processed primary transcript will encode 
NMDA receptor subunits that have regions of complete amino 
acid identity and regions having different amino acid 
sequences. Thus, the same genomic sequence can lead to the 
production of multiple, related mRNAs and proteins. Both 
the resulting mRNAs and proteins are referred to herein as 
"splice variants". 

Accordingly, also contemplated within the scope 
of the present invention are DNAs that encode NMDA receptor 
subunits as defined above, but that by virtue of degeneracy 
of the genetic code do not necessarily hybridize to the 
disclosed DNA under specified hybridization conditions. 
Such subunits also contribute to the formation of 
functional receptor, as assessed by methods described 
herein or known to those of skill in the art, with one or 
more additional NMDA receptor subunits of the same or 
different type (the presence of additional subunits of a 
different type is optional when said subunit is an NMDAR1 
subunit) . Typically, unless an NMDA receptor subunit is 
encoded by RNA that arises from alternative splicing (i.e., 
a splice variant) , NMDA receptor subunit-encoding DNA and 
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the NMDA receptor subunit encoded thereby share substantial 
sequence homology with at least one of the NMDA receptor 
subunit DNAs (and proteins encoded thereby) described 
herein. It is understood that DNA or RNA encoding a splice 
variant may share less than 90% overall sequence homology 
with the DNA or RNA provided herein, but include regions of 
nearly 100% homology to a DNA fragment described herein, 
and encode an open reading frame that includes start and 
stop codons and encodes a functional NMDA receptor subunit. 

As employed herein, the phrase "NMDA receptor 
subunit (s) of the NMDAR1 subtype" refers to proteins which, 
by hydrophobics analysis of deduced amino acid sequences, 
are believed to contain four or more putative transmembrane 
domains, preceded by a large extracellular N-terminal 
domain. The amino acid sequence typically contains 
possible phosphorylation sites for Ca 2 Vcalmodulin-dependent 
protein kinase type II and protein kinase C [see, for 
example, Kemp et al. (1990) Trends in Biological Science 
Vol. 15:342-346; Kishimoto et al . (1985) J. Biol. Chem. 
Vol. 260:12492-12499; Whittemore et al. (1993) Nature 
364:70-73.]. (These protein kinases reportedly play a 
crucial role in induction and maintenance of long term 
potentiation. ) 

25 T he putative TMII segment (i.e., second 

transmembrane domain) is typically flanked by a glutamic 
acid residue at the extracellular side and a stretch of 
glutamic acid residues at the cytoplasmic side. This 
segment contains an asparagine. residue believed to be 

30 responsible for high Ca 2+ permeability of the NMDAR channel. 
For a summary of NMDAR properties, see Ben-Ari et al., in 
TINS 15:333-339 (1992), especially at p. 334. 

Exemplary DNA sequences encoding human NMDAR1 
subunits are represented by nucleotides which encode 
35 substantially the same amino acid sequence as set forth in 
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Sequence ID Nos. 2, 2E, 2F, 2G, 2H. 21, 2J, 2K, 2L, 2M f 2N, 
or 2 P Presently preferred sequences encode substantially 
the same amino acid sequence as set forth in Sequence ID 
Nos . 2, 2E, 2F, 2G, 2H, 21 or 2P. 

Exemplary DNA can alternatively be characterized 
as those nucleotide sequences which encode a human NMDAR1 
subunit and hybridize under high stringency conditions to 
substantially the entire sequence of any one of Sequence ID 
Nos. 1, 1A, IB, 1C, ID, IE, IF, 1G, 1H, II, U, IK, 1L. 1M, 
IN, or IP, or substantial portions thereof (i.e., typically 
at' least 25-30 nucleotides thereof); preferably exemplary 
DNA will hybridize under high stringency conditions to 
substantially the entire sequence of any one of Sequence ID 
Nos. 1, IE, IF, 1G, 1H, II or IP, or substantial portions 
thereof . 

Stringency of hybridization is used herein to 
refer to conditions under which polynucleic acid hybrids 
are stable. As known to those of skill in the art, the 
stability of hybrids is reflected in the melting 
0 temperature (TJ of the hybrids. T m can be approximated by 
the formula: 

81.5-C - l6.6(log 10 [Na + ]) + 0.41(%G+C) - 600/1, 

where 1 is the length of the hybrids in nucleotides. T ra 
decreases approximately 1-1.5-C with every 1% decrease in 

5 sequence homology. In general, the stability of a hybrid 
is a function of sodium ion concentration and temperature. 
Typically, the hybridization reaction is performed under 
conditions of lower stringency, followed by washes of 
varying, but higher, stringency. Reference to 

0 hybridization stringency relates to such washing 
conditions. Thus, as 'used herein : 
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HIGH STRINGENCY conditions, with respect to 
fragment hybridization, refers to conditions 
that permit hybridization of only those 
nucleic acid sequences that form stable 
hybrids in 0.018M NaCl at 65°C (i.e., if a 
hybrid is not stable in 0.018M NaCl at 65°C, 
it will not be stable under high stringency 
conditions, as contemplated herein). High 
stringency conditions can be provided, for 
example, by hybridization in 50% formamide, 
5X Denharfs solution, 5X SSPE, 0.2% SDS at 
42°C, followed by washing in 0 . IX SSPE, and 
0.1% SDS at 65°C; 

MODERATE STRINGENCY conditions, with respect 
to fragment hybridization, refers to 
conditions equivalent to hybridization in 
50% formamide, 5X Denhart's solution, 5X 
SSPE, 0.2% SDS at 42°C, followed by washing 
in 0.2X SSPE, 0.2% SDS, at 65°C; 

LOW STRINGENCY conditions, with respect to 
fragment hybridization, refers to conditions 
equivalent to hybridization in 10% 
formamide, 5X Denhart's solution, 6X SSPE, 
0.2% SDS at 42°C, followed by washing in IX 
SSPE, 0.2% SDS, at 50°C; and 

HIGH STRINGENCY conditions, with respect to 
oligonucleotide (i.e., synthetic DNA < about 
30 nucleotides in length) hybridization, 
refers to conditions equivalent to 
hybridization in 10% formamide, 5X Denhart's 
solution, 6X SSPE, 0.2% SDS at 42°C, 
followed by washing in IX SSPE, and 0.2% SDS 
at 50°C. 
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It is understood that these conditions may be duplicated 
using a variety of buffers and temperatures and that they 
are not necessarily precise. 

Denhart's solution and SSPE (see, e.g., Sambrook, 
5 Fritsch, and Maniatis, in: Molecular Cloning, A Laboratory 
Manual , Cold Spring Harbor Laboratory Press, 1989) are well 
known to those of skill in the art as are other suitable 

hybridization buffers. For example, SSPE is P H 7.4 

phosphate-buffered 0.18M NaCl. SSPE can be prepared, for 
10 example, as a 20X stock solution by dissolving 175.3 g of 

NaCl, 27.6 g of NaH 2 PO, and 7.4 g EDTA in 800 ml of water, 

adjusting the P H to 7.4, and then adding water to 1 liter. 

Denhart's solution (see, Denhart (1966) Biochem. Biophys. 

Res. Commun. 23:641) can be prepared, for example, as a 50X 
15 stock solution by mixing 5 g Ficoll (Type 400, Pharmacia 

LKB Biotechnology, INC., Piscataway, NJ) , 5 g of 

polyvinylpyrrolidone, 5 g bovine serum albumin (Fraction V; 

Sigma, St. Louis, MO) water to 500 ml and filtering to 

remove particulate matter. 

20 Especially preferred seguences are those which 

have substantially the same nucleotide sequence as the 
coding sequences in any one of Sequence ID Nos. 1, IE, IF, 
1G, 1H, II, U, IK, 1L, 1M, IN, or IP; with those having 
substantially the same sequence as the coding sequence in 

25 Sequence ID Nos. 1, IE, IF, 1G, 1H, II or IP being most 
preferred . 

As used herein, the phrase "substantial sequence 
homology" refers to nucleotide sequences which share at 
least about 90% identity, and amino acid sequences which 
30 typically share more than 95% amino acid identity (>99% 
amino acid identity when dealing with NMDAR1 subunits) . It 
is recognized, however, that proteins (and DNA or mRNA 
encoding such proteins) containing less than the above- 
described level of homology arising as splice variants or 
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that are modified by conservative amino acid substitutions 
(or substitution of degenerate codons) are contemplated to 
be within the scope of the present invention. 

As used herein, the phrase "substantially the 
5 same" refers to the nucleotide sequences of DNA, the 
ribonucleotide sequences of RNA, or the amino acid 
sequences of protein, that have slight and non- 
consequential sequence variations from the actual sequences 
disclosed herein. Species that are "substantially the 

10 same" are considered to be equivalent to the disclosed 
sequences, and as such are within the scope of the appended 
claims. In this regard, "slight and non-consequential 
sequence variations" mean that sequences that are 
substantially the same as the DNA, RNA, or proteins 

15 disclosed and claimed herein, are functionally equivalent 
to the human-derived sequences disclosed and claimed 
herein. Functionally equivalent sequences will function in 
substantially the same manner to produce substantially the 
same compositions as the human-derived nucleic acid and 

20 amino acid compositions disclosed and claimed herein. In 
particular, functionally equivalent DNAs encode human- 
derived proteins that are the same as those disclosed 
herein or that have conservative amino acid variations, 
such as substitution of a non-polar residue for another 

25 non-polar residue or a charged residue for a similarly 
charged residue. These changes include those recognized by 
those of skill in the art as those that do not 
substantially alter the tertiary structure of the protein. 

As employed herein, the phrase "NMDA receptor 
30 subunit(s) of the NMDAR2 subtype" refers to proteins which 
have a large putative extracellular domain at the amino- 
terminal region. Otherwise, the deduced structure of 
NMDAR2 subunits displays the same general characteristics 
as the NMDAR1 subunit structure. A notable typical 
35 exception is that the negatively charged glutamic acid 
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residues that are generally present in the putative TMII 
segment of NMDARl subunits are generally absent from the 
TMII segment of NMDAR2 . Instead, NMDAR2 subunits may 
contain a positively charged lysine residue in TMII. 
5 Unlike NMDARl subunits, NMDAR2 subunits generally do not 
form homomeric NMDA receptors. Moreover, the amino acid 
sequences of NMDARl and NMDAR2 subunits are generally less 
than 50% identical, with identities of less than 30% 
typically observed. 

10 NMDAR2 subunits contemplated by the present 

invention include NMDAR2A, NMDAR2B, NMDAR2C and NMDAR2D 
types of subunits. Exemplary DNA sequences encoding human 
NMDAR2A subunits, or portions thereof, are represented by 
nucleotides which encode substantially the same amino acid 

15 sequence as set forth in Sequence ID No. 11, or 
substantially the same amino acid sequence as that encoded 
by the NMDAR2A-encoding portion of clone NMDA57 , deposited 
with the ATCC under accession number 75442. 

The deposited clone has been deposited at the 

20 American Type Culture Collection (ATCC), 12301 Parklawn 
Drive, Rockville, Maryland, U.S.A. 20852, under the terms 
of the Budapest Treaty on the International Recognition of 
Deposits of Microorganisms for Purposes of Patent Procedure 
and the Regulations promulgated under this Treaty. Samples 

25 of the deposited material are and will be available to 
industrial property offices and other persons legally 
entitled to receive them under the terms of the Treaty and 
Regulations and otherwise in compliance with the patent 
laws and regulations of the United States of America and 

30 all other nations or international organizations in which 
this application, or an application claiming priority of 
this application, is filed or in which any patent granted 
on any such application is granted. In particular, upon 
issuance of a U.S. patent based on this or any application 

35 claiming priority to or incorporating this application by 
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reference thereto, all restriction upon availability of the 
deposited material will be irrevocably removed. 

Exemplary human NMDAR2 A subunit-encoding DNAs can 
alternatively be characterized as those nucleotide 

5 sequences which hybridize under high stringency conditions 
to substantially the entire sequence of Sequence ID No. 10, 
or substantial portions thereof (i.e., typically at least 
25-30 nucleotides thereof) , or the NMDAR2A-encoding portion 
of clone NMDA57 (ATCC accession No. 75442). Especially 

10 preferred sequences encoding human NMDAR2A subunits are 
those which have substantially the same nucleotide sequence 
as the coding sequence of Sequence ID No. 10, or those 
which contain substantially the same nucleotide sequence as 
the coding sequence in the NMDAR2 A-encoding portion of 

15 clone NMDA57 . 

Exemplary DNA sequences encoding human NMDAR2B 
subunits are represented by nucleotides which encode 
substantially the same amino acid sequence as set forth in 
Sequence ID No. 14. Exemplary DNAs can alternatively be 

20 characterized as those nucleotide sequences which encode a 
human NMDAR2B subunit and hybridize under high stringency 
conditions to substantially the entire sequence of Sequence 
ID No. 13, or substantial portions thereof (i.e., typically 
at least 25-30 nucleotides thereof) . Especially preferred 

25 NMDAR2B-encoding sequences are those which have 
substantially the same nucleotide sequence as the coding 
sequence in Sequence ID No. 13. 

Exemplary DNA sequences encoding human NMDAR2C 
subunits are represented by nucleotides which encode 
30 substantially the same amino acid sequence as set forth in 
Sequence ID Nos . 6, 6E, 6F, 6G, 6H or 61. 

Exemplary DNAs can alternatively be characterized 
as those nucleotide sequences which encode a human NMDAR2C 
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subunit and hybridize under high stringency conditions to 
substantially the entire sequence of any one of Sequence ID 
Nos. 5, 5A, 5B, 5C, 5D, 5E # 5F, 5G, 5H, or 51, or 
substantial portions thereof (i.e., typically at least 25- 
5 30 nucleotides thereof) ; preferably exemplary DNA will 
hybridize under high stringency conditions to substantially 
the entire sequence of any one of Sequence ID Nos. 5, 5E, 
5F, or 5G, or substantial portions thereof. 

Especially preferred NMDAR2C-encoding sequences 
10 are those which have substantially the same nucleotide 
sequence as. the coding sequences in any one of Sequence ID 
Nos. 5, 5E, 5F, 5G , 5H or 51; with those having 
substantially the same sequence as the coding sequences in 
Sequence ID Nos. 5, 5E, 5F, or 5G being most preferred. 

Exemplary DNA sequences encoding human NMDAR2D 
subunits are represented by nucleotides which encode 
substantially the same amino acid sequence as set forth in 
Sequence ID No. 16. Exemplary DNAs can alternatively be 
characterized as those nucleotide sequences which encode a 
20 human NMDAR2D subunit and hybridize under high stringency 
conditions to substantially the entire sequence of Sequence 
ID No. 15, or substantial portions thereof (i.e., typically 
at least 25-30 nucleotides thereof). Especially preferred 
NMDAR2D-encoding sequences are those which have 
25 substantially the same nucleotide sequence as the coding 
sequence in Sequence ID No. 15. 

DNA encoding human NMDA receptor subunits may be 
isolated by screening suitable human cDNA or human genomic 
libraries under suitable hybridization conditions with DNA 
30 disclosed herein (including nucleotides derived from any of 
SEQ ID Nos. 1, 1A-1P, 5, 5A-5I, 10, 13 or 15). Suitable 
libraries can be prepared from neuronal tissue samples, 
e.g., hippocampus and cerebellum tissue, cell lines, and 
the like. For example, the library can be screened with a 
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portion of DNA including substantially the entire subunit- 
encoding sequence thereof, or the library may be screened 
with a suitable probe. 

As used herein, a probe is single-stranded DNA or 
5 RNA that has a sequence of nucleotides that includes at 
least 14 contiguous bases that are the same as (or the 
complement of) any 14 or more contiguous bases set forth in 
any of SEQ ID Nos . 1, 1A-1P, 5, 5A-5I, 10, 13 or 15. 
Preferred regions from which to construct probes include 5- 
10 and/or 3- coding sequences, sequences predicted to encode 
transmembrane domains, sequences predicted to encode 
cytoplasmic" loops, signal sequences, NMDA binding sites, 
and the like. 

Either the full-length cDNA clones or fragments 
15 thereof can be used as probes, preferably labeled with 
suitable label means for ready detection. When fragments 
are used as probes, preferably the DNA sequences will be 
from the carboxyl end-encoding portion of the DNA, and most 
preferably will include predicted transmembrane domain- 
20 encoding portions of the DNA sequence (the domains can be 
predicted based on hydropathy analysis of the deduced ammo 
acid sequence using, for example, the method of Kyte and 
Doolittle (1982), J. Mol. Biol. Vol. 152:105). These 
probes can be used, for example, for the identification and 
25 isolation of additional members of the glutamate receptor 
family. 

As a particular application of the invention 
sequences, genetic screening can be carried out using the 
nucleotide sequences of the invention as probes. Thus, 
30 nucleic acid samples from patients having neuropathology! 
conditions suspected of involving alteration/modification 
of any one or more of the glutamate receptors can be 
screened with appropriate probes to determine if any 
abnormalities exist with respect to any of the endogenous 
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glutamate receptors. Similarly, patients having a family 

history of disease states related to glutamate receptor 

dysfunction can be screened to determine if they are also 
predisposed to such disease states. 

5 in accordance with another embodiment of the 

present invention, there is provided a method for 
identifying DNA encoding human N-methy 1-D-aspartate (NMDA) 
receptor protein subunit(s), said method comprising: 

contacting human DNA with a nucleic acid probe as 
10 described above, wherein said contacting is carried out 
under high stringency hybridization conditions, and 

identifying DNA(s) which hybridize to said probe. 

After screening the library, positive clones are 
identified by detecting a hybridization signal; the 
15 identified clones are characterized by restriction enzyme 
mapping and/or DNA sequence analysis, and then examined, by 
comparison with the sequences set forth herein to ascertain 
whether they include DNA encoding a complete NMDA receptor 
subunit (i.e., if they include translation initiation and 
20 termination codons) . If the selected clones are 

incomplete, they may be used to rescreen the same or a 
different library to obtain overlapping clones. If the 
library is genomic, then the overlapping clones may include 
exons and introns . If the library is a cDNA library, then 
25 the overlapping clones will include an open reading frame, 
in both instances, complete clones may be identified by 
comparison with the DNA and encoded proteins provided 
herein. 

Complementary DNA clones encoding various human 
30 NMDA receptor subunits (e.g., NMDAR1 , NMDAR2A, NMDAR2B, 
NMDAR2C, NMDAR2D) have been isolated. Each type of subunit 
appears to be encoded by a different gene. The DNA clones 
provided herein may be used to isolate genomic clones 
encoding each type of subunit and to isolate any splice 
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variants by screening libraries prepared from different 
neural tissues. Nucleic acid amplification techniques, 
which are well known in the art, can be used to locate DNA 
encoding splice variants of human NMDA receptor subunits. 
5 This is accomplished by employing oligonucleotides based on 
DNA sequences surrounding divergent sequence (s) as primers 
for amplifying human RNA or genomic DNA. Size and sequence 
determinations of the amplification products can reveal the 
existence of splice variants. Furthermore, isolation of 
10 human genomic DNA sequences by hybridization can yield DNA 
containing multiple exons, separated by introns, that 
correspond to different splice variants of transcripts 
encoding human NMDA receptor subunits. 

It has been found that not all subunits (and 
15 variants thereof) are expressed in all neural tissues or in 
all portions of the brain. Thus, in order to isolate cDNA 
encoding a particular subunit or splice variants thereof, 
it is preferable to screen libraries prepared from 
different neuronal or neural tissues. Preferred tissues to 
20 use as sources of nucleic acids for preparing libraries to 
obtain DNA encoding each subunit include: hippocampus to 
isolate human NMDARl-encoding DNAs ; hippocampus, cerebellum 
and fetal brain to isolate NMDAR2-encoding DNAs; and the 
like. 

25 Once DNA encoding a subunit has been isolated, 

ribonuclease (RNase) protection assays can be employed to 
determine which tissues express mRNA encoding a particular 
NMDAR subunit subtype or variant. These assays provide a 
sensitive means for detecting and quantitating an RNA 

30 species in a complex mixture of total cellular RNA. The 
subunit DNA is labeled and hybridized with cellular RNA. 
If complementary mRNA is present in the cellular RNA, a 
DNA-RNA hybrid results. The RNA sample is then treated 
with RNase, which degrades single-stranded RNA. Any RNA- 

35 DNA hybrids are protected from RNase degradation and can be 
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visualed by gel electrophoresis and autoradiography. In 
situ hybridization techniques can also be used to determine 
which tissues express mRNA encoding a particular NMDAR 
subunit. The labeled subunit DNAs are hybridized to 
5 different brain region slices to visualize subunit mRNA 
expression . 

The distribution of expression of some human NMDA 
receptor subunits may differ from the distribution of such 
receptors in rat. For example, RNA encoding the rat 

10 NMDAR2C subunit is abundant in rat cerebellum, but is not 
abundant in rat hippocampus [see, e.g., Monyer et al., 
Science 256:1217-1221 (1992)]. Numerous human NMDAR2C 
clones were ultimately obtained, however, from a human 
hippocampus library. Thus, the distribution of some NMDA 

15 receptor subunits in humans and rats appears to be 
different . 

The above-described nucleotide sequences can be 
incorporated into vectors for further manipulation. As 
used herein, vector (or plasmid) refers to discrete 
20 elements that are used to introduce heterologous DNA into 
cells for either expression or replication thereof. 
Selection and use of such vehicles are well within the 
skill of the artisan. 

An expression vector includes vectors capable of 
25 expressing DNAs that are operatively linked with regulatory 
sequences, such as promoter regions, that are capable of 
effecting expression of such DNA fragments. Thus, an 
expression vector refers to a recombinant DNA or RNA 
construct, such as a plasmid, a phage, recombinant virus or 
30 other vector that, upon introduction into an appropriate 
host cell, results in expression of the cloned DNA. 
Appropriate expression vectors are well known to those of 
skill in the art and include those that are replicable in 
eukaryotic cells and/or prokaryotic cells and those that 
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remain episomal or those which integrate into the host cell 
genome. Presently preferred plasmids for expression of 
invention NMDA receptor subunits in eukaryotic host cells, 
particularly mammalian cells, include cytomegalovirus (CMV) 
5 promoter-containing vectors such as P CMV-T7-2 or P CMV-T7-3 
(see Figure 6), pMMTVT7(+) or pMMTVT7(-) (modified versions 
of pMAMneo (Clontech, Palo Alto, CA) , prepared as described 
herein) , pcDNAl, and the like. 

As used herein, a promoter region refers to a 

10 segment of DNA that controls transcription of DNA to which 
it is operatively linked. The promoter region includes 
specific sequences that are sufficient for RNA ^polymerase 
recognition, binding and transcription initiation. This 
portion of the promoter region is referred to as the 

15 promoter. In addition, the promoter region includes 
sequences that modulate this recognition, binding and 
transcription initiation activity of RNA polymerase. These 
sequences may be cis acting or may be responsive to trans 
acting factors. Promoters, depending upon the nature of 

20 the regulation, may be constitutive or regulated. 
Exemplary promoters contemplated for use in the practice of 
the present invention include the SV40 early promoter, the 
cytomegalovirus (CMV) promoter, the mouse mammary tumor 
virus (MMTV) steroid-inducible promoter, Moloney murine 

25 leukemia virus (MMLV) promoter, and the like. 

As used herein, the term "operatively linked" 
refers to the functional relationship of DNA with 
regulatory and effector sequences of nucleotides, such as 
promoters, enhancers, transcriptional and translational 

30 stop sites, and other signal sequences. For example, 
operative linkage of DNA to a promoter refers to the 
physical and functional relationship between the DNA and 
the promoter such that the transcription of such DNA is 
initiated from the promoter by an RNA polymerase that 

35 specifically recognizes and binds to the promoter, and 
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transcribes the DNA . In order to optimize expression 
and/or in vitro transcription, it may be necessary to 
remove, add or alter 5' and/or 3' untranslated portions of 
the clones to eliminate extra, potential inappropriate 
5 alternative translation initiation (i.e., start) codons or 
other sequences that may interfere with or reduce 
expression, either at the level of transcription or 
translation. Alternatively, consensus ribosome binding 
sites (see, for example, Kozak (1991) J. Biol. Chem. 
10 266.: 19867-19870) can be inserted immediately 5' of the 
start codon and may enhance expression. Likewise, 
alternative codons, encoding the same amino acid, can be 
substitutecT'for coding sequences of the NMDAR subunits in 
order to enhance transcription (e.g., the codon preference 
15 of the host cells can be adopted, the presence of G-C rich 
domains can be reduced, and the like). Furthermore, for 
potentially enhanced expression of NMDA receptor subunits 
in amphibian oocytes, the subunit coding sequence can 
optionally be incorporated into an expression construct 
2 0 wherein the 5'- and 3 ' -ends of the coding sequence are 
contiguous with Xenopus £-globin gene 5 ' and 3 ' 
untranslated sequences, respectively. For example, NMDA 
receptor subunit coding sequences can be incorporated into 
vector pSP64T (see Krieg and Melton (1984) in Nucleic Acids 
25 Research 12:7057-7070), a modified form of pSP64 (available 
from Promega, Madison, WI) . The coding sequence is 
inserted between the 5' end of the £-globin gene and the 3' 
untranslated sequences located downstream of the SP6 
promoter. In vitro transcripts can then be generated from 
teh resulting vector. The desirability of (or need for) 
such modification may be empirically determined. 
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As used herein, expression refers to the process 
by which polynucleic acids are transcribed into mRNA and 
translated into peptides, polypeptides, or proteins. If 
35 the polynucleic acid is derived from genomic DNA, 
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expression may, if an appropriate eukaryotic host cell or 
organism is selected, include splicing of the mRNA . 

Particularly preferred vectors for transfection 
of mammalian cells are the P SV2dhfr expression vectors, 
which contain the SV40 early promoter, mouse dhfr gene, 
SV40 polyadenylation and splice sites and sequences 
necessary for maintaining the vector in bacteria, 
cytomegalovirus (CMV) promoter-based vectors such as 
PCMV-T7-2 and pCMV-T7-3 (described herein) or pCDNAl 
(Invitrogen, San Diego, CA) , and MMTV promoter-based 
vectors such as pMMTVT7(+) or pMMTVT7(-), described herein. 



Full-length DNAs encoding human NMDA receptor 
subunits have been inserted into vectors pcDNAl, 
pMMTVT7(+), pCMV-T7— 2 and pCMV-T7-3 . pCMV-T7-2 is a pUC19- 

15 based mammalian cell expression vector containing the CMV 
promoter/enhancer, SV40 splice/donor sites located 
immediately downstream of the promoter, a T7 bacteriophage 
RNA polymerase promoter positioned downstream of the splice 
sites, followed by an SV40 polyadenylation signal and a 

20 polylinker between the T7 promoter and the polyadenylation 
signal. Placement of NMDA receptor subunit DNA between the 
CMV promoter and SV40 polyadenylation signal should provide 
for constitutive expression of the foreign DNA in a 
mammalian host cell transfected with the construct. 

25 Plasmid pCMV-T7-3 is identical to pCMV-T7-2 except that the 
order of restriction enzyme sites in the polylinker is 
reversed . 

Vectors pMMTVT7(+) and pMMTVT7(-) were prepared 
by modifying vector pMAMneo (Clontech, Palo Alto, CA) . 
30 pMAMneo is a mammalian expression vector that contains the 
Rous Sarcoma Virus (RSV) long terminal repeat (LTR) 
enhancer, linked to ' the dexamethasone-inducible mouse 
mammary tumor virus (MMTV) -LTR promoter, followed by SV40 
splicing and polyadenylation sites. pMAMneo also contains 
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the E . coli rieo gene for selection of transf ormants , as 
well as the ^-lactamase gene (encoding a protein which 
imparts ampicillin-resistance) for propagation in E . coli. 

Vector pMMTVT7(+) can be generated by 
5 modification of pMAMneo to remove the neo gene and insert 
the multiple cloning site and T7 and T3 promoters from 
pBluescript (Stratagene, La Jolla, CA) . Thus, pMMTVT7(+) 
contains the RSV-LTR enhancer linked to the MMTV-LTR 
promoter, a T7 bacteriophage RNA polymerase promoter 

10 positioned downstream of the MMTV-LTR promoter, a 
polylinker positioned downstream of the T7 promoter, a T3 
bacteriophage RNA polymerase promoter positioned downstream 
of the T7 promoter, and SV40 splicing and polyadenylation 
sites positioned downstream of the T3 promoter. The 

15 ^-lactamase gene (encoding a protein which imparts 
ampicillin-resistance) from pMAMneo is retained in 
pMMTVT7(+), although it is incorporated in the reverse 
orientation relative to the orientation in pMAMneo. 

Vector pMMTVT7(-) is identical to pMMTVT7(+) 
20 except that the positions of the T7 and T3 promoters are 
switched, i.e., the T3 promoter in pMMTVT7(-) is located 
where the T7 promoter is located in pMMTVT7(+), and the T7 
promoter in pMMTVT7(-) is located where the T3 promoter is 
located in pMMTVT7 (+) . Therefore, vectors pMMTVT7(+) and 
25 pMMTVT7(-) contain all of the regulatory elements required 
for expression of heterologous DNA in a mammalian host 
cell, wherein the heterologous DNA has been incorporated 
into the vectors at the polylinker. In addition, because 
the T7 and T3 promoters are located on either side of the 
30 polylinker, these plasmids can be used for synthesis of in 
vitro transcripts of heterologous DNA that has been 
subcloned into the vectors at the polylinker. 

r 

For inducible expression of human NMDA receptor 
subunit-encoding DNA in a mammalian cell, the DNA can be 
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inserted into a plasmid such as pMMTVT7(+) or pMMTVT7(-). 
These plasmids contain the mouse mammary tumor virus (MMTV) 
promoter for steroid-inducible expression of operatively 
associated foreign DNA. If the host cell does not express 
5 endogenous glucocorticoid receptors reguired for uptake of 
glucocorticoids (i.e., inducers of the MMTV promoter) into 
the cell, it is necessary to additionally transfect the 
cell with DNA encoding the glucocorticoid receptor (ATCC 
accession no. 67200). For synthesis of in vitro 

10 transcripts, full-length human DNA clones encoding human 
NMDAR1, NMDAR2A, NMDAR2B, NMDAR2C and NMDAR2D can also be 
subcloned into pIBI24 (International Biotechnologies, Inc., 
New Haven,' CT) , pCMV-T7-2, pCMV-T7-3, pMMTVT7 ( + ) , 
pMMTVT7(-), pBluescript (Stratagene, La Jolla, CA) or 

15 pGEM7Z (Promega, Madison, WI) . 

In accordance with another embodiment of the 
present invention, there are provided cells containing the 
above-described polynucleic acids (i.e., DNA or mRNA) . 
Such host cells as bacterial, yeast and mammalian cells can 

20 be used for replicating DNA and producing NMDA receptor 
subunit(s) . Methods for assessing receptor expression and 
function are described in PCT Application Nos. 
PCT/US91/05625 and PCT/US92 / 11090 , and in co-pending U.S. 
Application Serial Nos. 07/563,751 and 07/812,254. The 

25 subject matter of these documents is hereby incorporated by 
reference herein in their entirety. 



Incorporation of cloned DNA into a suitable 
expression vector, transfection of eukaryotic cells with a 
plasmid vector or a combination of plasmid vectors, each 

30 encoding one or more distinct genes or with linear DNA, and 
selection of transfected cells are well known in the art 
(see, e.g., Sambrook et al. (1989) Molecular Cloning: A 
Laboratory Manual . Second Edition, Cold Spring Harbor 
Laboratory Press) . Heterologous DNA may be introduced into 

35 host cells by any method known to those of skill in the 
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art, such as transfection with a vector encoding the 
heterologous DNA by CaP0 4 precipitation (see, e.g., Wigler 
et al. (1979) Proc. Natl. Acad. Sci. 76.: 137 3 -137 6 ) or 
lipofectamine (GIBCO BRL #18324-012). Recombinant cells 
5 can then be cultured under conditions whereby the 
subunit(s) encoded by the DNA is (are) expressed. 
Preferred cells include mammalian cells (e.g., HEK293, CHO, 
BHKBI and Ltk" cells, mouse monocyte macrophage P388D1 and 
J774A-1 cells (available from ATCC, Rockville, MD) , and the 
10 like), yeast cells (e.g., methy lotrophic yeast cells, such 
as Pichia pastoris) , bacterial cells (e.g., Escherichia 
coli) , and the like. 

While the DNA provided herein may be expressed in 
any eukaryotic cell, including yeast cells (such as, for 

15 example, P. pastoris (see U.S. Patent Nos. 4,882,279, 
4,837,148, 4,92 9,555 and 4,855,231), Saccharomyces 
cerevisiae, Candida tropicalis , Hansenula polymorpha, and 
the like), mammalian expression systems, including 
commercially available systems and other such systems known 

20 to those of skill in the art, for expression of DNA 
encoding the human NMDA receptor subunits provided herein 
are presently preferred. Xenopus oocytes are preferred for 
expression of in vitro RNA transcripts of the DNA. 

In preferred embodiments, human NMDAR subunit- 
25 encoding DNA is ligated into a vector, and introduced into 
suitable host cells to produce transformed cell lines that 
express a specific human NMDA receptor subtype, or specific 
combinations of subunits. The resulting cell lines can 
then be produced in quantity for reproducible quantitative 
30 analysis of the effects of known or potential drugs on 
receptor function. In other embodiments, mRNA may be 
produced by in vitro, transcription of DNA encoding each 
subunit. This mRNA, either from a single subunit clone or 
from a combination of clones, can then be injected into 
35 Xenopus oocytes where the mRNA directs the synthesis of the 
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human receptor subunits, which then form functional 
receptors. Alternatively, the subun it-encod ing DNA can be 
directly injected into oocytes for expression of functional 
receptors. The transfected mammalian cells or injected 
5 oocytes may then be used in the methods of drug screening 
provided herein. 

Eukaryotic cells in which DNA or RNA may be 
introduced include any cells that are transf ectable by such 
DNA or RNA or into which such DNA or RNA may be injected. 

10 Preferred cells are those that can be transiently or stably 
transfected and also express the DNA and RNA. Presently 
most preferred cells are those that can form recombinant or 
heterologous human NMDA receptors comprising one or more 
subunits encoded by the heterologous DNA. Such cells may 

15 be identified empirically or selected from among those 
known to be readily transfected or injected. 

Exemplary cells for introducing DNA include cells 
of mammalian origin (e.g., COS cells, mouse L cells, 
Chinese hamster ovary (CHO) cells, human embryonic kidney 

20 (HEK) cells (particularly HEK293 cells that can be frozen 
in liquid nitrogen and then thawed and regrown; for 
example, those described in U.S. Patent No. 5,024,939 to 
Gorman (see, also, Stillman et al. (1985) Mol . Cell. Biol. 
5:2051-2060)), African green monkey cells and other such 

25 cells known to those of skill in the art) , amphibian cells 
(e.g., Xenopus laevis oocytes), yeast cells (e.g., 
Saccharomyces cerevisiae , Pichia pastoris) , and the like. 
Exemplary cells for expressing injected RNA transcripts 
include Xenopus laevis oocytes. Cells that are preferred 

30 for transfection of DNA are known to those of skill in the 
art or may be empirically identified, and include HEK293 
(which are available from ATCC under accession #CRL 1573); 
Ltk" cells (which are available from ATCC under accession 
#CCL1.3); COS-7 cells (which are available from ATCC under 

35 accession #CRL 1651) ; and DG44 cells (dhfr* CHO cells; see, 



e.g., Urlaub et al. (1986) Cell. Molec. Genet. 12: 555). 
Presently preferred cells include Ltk" cells and DG44 cells. 

DNA may be stably incorporated into cells or may 
be transiently expressed using methods known in the art. 
Stably transfected mammalian cells may be prepared by 
transfecting cells with an expression vector having a 
selectable marker gene (such as, for example, the gene for 
thymidine kinase, dihydrof olate reductase, neomycin 
resistance, and the like), and growing the transfected 
cells under conditions selective for cells expressing the 
marker gene.. To prepare transient transf ectants , mammalian 
cells are transfected with a reporter gene (such as the E. 
coli fl-galactosidase gene) to monitor transfection 
efficiency. Selectable marker genes are not included in 
the transient transf ections because the transf ectants are 
typically not grown under selective conditions, and are 
usually analyzed within a few days after transfection. 

To produce such stably or transiently transfected 
cells, the cells should be transfected with a sufficient 
concentration of subunit-encoding nucleic acids to form 
human NMDA receptors that contain the human subunits 
encoded by heterologous DNA . The precise amounts and 
ratios of DNA encoding the subunits may be empirically 
determined and optimized for a particular combination of 
subunits, cells and assay conditions. Recombinant cells 
that express NMDA receptors containing subunits encoded 
only by the heterologous DNA or RNA are especially 
preferred. 

Heterologous DNA may be maintained in the cell as 
an episomal element or may be integrated into chromosomal 
DNA of the cell. The resulting recombinant cells may then 
be cultured or subcultured (or passaged, in the case of 
mammalian cells) from such a culture or a subculture 
thereof. Methods for transfection, injection and culturing 
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recombinant cells are known to the skilled artisan. 
Similarly, the human NMDA receptor subunits may be purified 
using protein purification methods known to those of skill 
in the art. For example, antibodies or other ligands that 
5 specifically bind to one or more of the subunits may be 
used for affinity purification and immunoprecipitation of 
the subunit or human NMDA receptors containing the 
subunits . 

As used herein, heterologous or foreign DNA and 

10 RNA are used interchangeably and refer to DNA or RNA that 
does not occur naturally as part of the genome of the cell 
in which itis present or to DNA or RNA which is found in 
a location or locations in the genome that differ from that 
in which it occurs in nature. Typically, heterologous or 

15 foreign DNA and RNA refers to DNA or RNA that is not 
endogenous to the host cell and has been artificially 
introduced into the cell. Examples of heterologous DNA 
include DNA that encodes a human NMDA receptor subunit, DNA 
that encodes RNA or proteins that mediate or alter 

20 expression of endogenous DNA by affecting transcription, 
translation, or other regulatable biochemical processes, 
and the like. The cell that expresses heterologous DNA may 
contain DNA encoding the same or different expression 
products. Heterologous DNA need not be expressed and may 

25 be integrated into the host cell genome or maintained 
episomally . 

Recombinant receptors on recombinant eukaryotic 
cell surfaces may contain one or more subunits encoded by 
the DNA or mRNA encoding human NMDA receptor subunits, or 

30 may contain a mixture of subunits encoded by the host cell 
and subunits encoded by heterologous DNA or mRNA. 
Recombinant receptors may be homomeric or may be a 
heteromeric combination of multiple subunits. Mixtures of 
DNA or mRNA encoding receptors from various species, such 

35 as rats and humans, may also be introduced into the cells. 
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Thus, a cell can be prepared that expresses recombinant 
receptors containing only NMDAR1 subunits, or a combination 
of any one or more NMDAR1 and any one or more NMDAR2 
subunits provided herein. For example, NMDAR1 subunits of 
5 the present invention can be co-expressed with NMDAR2A, 
NMDAR2B , NMDAR2C and/or NMDAR2D receptor subunits. 
Specific examples of heteromeric combinations of 
recombinant human NMDAR subunits that have been expressed 
in Xenopus oocytes include NMDAR1 + NMDAR2A, NMDAR1 + 
10 NMDAR2B , and NMDAR1 + NMDAR2A + NMDAR2C (see Example 9). 

The DNA, mRNA , vectors, receptor subunits, 
receptor subunit combinations and cells provided herein 
permit production of selected NMDA receptor subunits and 
specific combinations thereof, as well as antibodies to 

15 said receptor subunits. This provides a means to prepare 
synthetic or recombinant receptors and receptor subunits 
that are substantially free of contamination from many 
other receptor proteins whose presence can interfere with 
analysis of a single NMDA receptor subtype. The 

20 availability of desired receptor subtypes makes it possible 
to observe the effect of a drug substance on a particular 
receptor subtype or combination of NMDA receptor subunits, 
and to thereby perform initial in vitro screening of the 
drug substance in a test system that is specific for humans 

25 and specific for a human NMDA receptor subtype or 
combination of NMDA receptor subunits. The availability of 
specific antibodies makes it possible to identify the 
subunit combinations expressed in vivo. Such specific 
combinations can then be employed as preferred targets in 

30 drug screening. 

The ability to screen drug substances in vitro to 
determine the effect ^ of the drug on specific receptor 
compositions should permit the development and screening of 
receptor subtype-specific or disease-specific drugs. Also, 
35 testing of single receptor subunits or specific 
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.binations of various types of receptor subunits with a 
variety of potential agonists or antagonists provides 
additional information with respect to the function and 
activity of the individual subunits and should lead to the 
5 identification and design of compounds that are capable of 
very specific interaction with one or more types of 
receptor subunits or receptor subtypes. The resulting 
drugs should exhibit fewer unwanted side effects than drugs 
identified by screening with cells that express a variety 
10 of receptor subtypes. 

Further in relation to drug development and 
therapeutic" treatment of various disease states, the 
availability of DNAs encoding human NMDA receptor subunits 
enables identification of any alterations in such genes 

15 (e.g., mutations) which may correlate with the occurrence 
of certain disease states. In addition, the creation of 
animal models of such disease states becomes possible, by 
specifically introducing such mutations into synthetic DNA 
sequences which can then be introduced into laboratory 

20 animals or in vitro assay systems to determine the effects 
thereof. 

In another aspect, the invention comprises 
functional peptide fragments, and functional combinations 
thereof, encoded by the DNAs of the invention. Such 

25 functional peptide fragments can be produced by those 
skilled in the art, without undue experimentation, by 
eliminating some or all of the amino acids in the sequence 
not essential for the peptide to. function as a glutamate 
receptor. A determination of the amino acids that are 

30 essential for glutamate receptor function is made, for 
example, by systematic digestion of the DNAs encoding the 
peptides and/or by the introduction of deletions into the 
DNAs. The modified (e.g., deleted or digested) DNAs are 
expressed, for example, by transcribing the DNA and then 

35 introducing the resulting mRNA into Xenopus oocytes, where 



translation of the mRNAs will occur. Functional analysis 
of the proteins thus expressed in the oocytes is 
accomplished by exposing the oocytes to ligands known to 
bind to and functionally activate glutamate receptors, and 
then monitoring the oocytes to see if the expressed 
fragments form ion channel (s). If ion channel (s) are 
detected, the fragments are functional as glutamate 
receptors . 

The above-described method can be carried out in 
the presence of NMDAR1- 1 ike receptor subunits alone, or in 
the presence of combinations of NMDAR1 - like and NMDAR2 -1 ike 
receptor subunits. Thus, for example, when the protein 
being tested is an NMDAR2 -1 ike receptor subunit, the 
additional subunit is preferably an NMDAR1-1 ike subunit. 

In accordance with still another embodiment of 
the present invention, there is provided a method for 
identifying compounds which bind to human N-methyl- 
D-aspartate (NMDA) receptor subunit (s), said method 
comprising employing receptor proteins of the invention in 
a competitive binding assay. Such an assay can accomodate 
the rapid screening of a large number of compounds to 
determine which compounds, if any, are capable of binding 
to NMDA receptors. Subsequently, more detailed assays can 
be carried out with those compounds found to bind, to 
further determine whether such compounds act as modulators, 
agonists or antagonists of invention receptors. 

Another application of the binding assay of the 
invention is the assay of test samples (e.g., biological 
fluids) for the presence or absence of receptors of the 
present invention. Thus, for example, serum from a patient 
displaying symptoms related to glutamatergic pathway 
dysfunction can be assayed to determine if the observed 
symptoms are perhaps caused by over- or under-production of 
such receptor (s). 
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The binding assays contemplated by the present 
invention can be carried out in a variety of ways, as can 
readily be identified by those of skill in the art. For 
example, competitive binding assays can be employed, such 
5 as radioreceptor assays, and the like. 

In accordance with a further embodiment of the 
present invention, there is provided a bioassay for 
identifying compounds which modulate the activity of human 
NMDA receptors of the invention, said bioassay comprising: 
10 (a) exposing cells containing DNA encoding human 

NMDA receptor subunit(s) , wherein said cells 
express functional NMDA receptors, to at 
least one compound whose ability to modulate 
the ion channel activity of said receptors 
15 is sought to be determined; and thereafter 

(b) monitoring said cells for changes in ion 
channel activity. 

The above-described bioassay enables the 
identification of agonists and antagonists for human NMDA 

2 0 receptors. According to this method, recombinant NMDA 
receptors are contacted with an "unknown" or test substance 
(in the further presence of a known NMDA agonist, when 
antagonist activity is being tested) , the ion channel 
activity of the known glutamate receptor is monitored 

25 subsequent to the contact with the "unknown" or test 
substance, and those substances which increase or decrease 
the ion channel response of the known glutamate receptor (s) 
are identified as functional ligands (i.e., modulators, 
agonists or antagonists) for human NMDA receptors. 
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In accordance with a particular embodiment of the 
present invention, recombinant human NMDA 

receptor-expressing mammalian cells or oocytes can be 
contacted with a test compound, and the modulating 
effect (s) thereof can then be evaluated by comparing the 



NMDA receptor-mediated response in the presence and absence 
of test compound, or by comparing the response of test 
cells, or control cells (i.e., cells that do not express 
NMDA receptors) , to the presence of the compound. 

As used herein, a compound or signal that 
"modulates the activity of an NMDA receptor 1 ' refers to a 
compound or signal that alters the activity of NMDA 
receptors so that activity of the NMDA receptor is 
different in the presence of the compound or signal than in 
the absence of the compound or signal. In particular, such 
compounds or signals include agonists and antagonists. The 
term agonist refers to a substance or signal, such as NMDA, 
that activates receptor function; and the term antagonist 
refers to a substance that interferes with receptor 
function. Typically, the effect of an antagonist is 
observed as a blocking of activation by an agonist. 
Antagonists include competitive and non-competitive 
antagonists. A competitive antagonist (or competitive 
blocker) interacts with or near the site specific for the 
agonist (e.g., ligand or neurotransmitter). A non- 
competitive antagonist or blocker inactivates the 
functioning of the receptor by interacting with a site 
other than the site that interacts with the agonist. 

As understood by those of skill in the art, assay 
methods for identifying compounds that modulate human NMDA 
receptor activity (e.g., agonists and antagonists) 
generally require comparison to a control. One type of a 
"control" cell or "control 11 culture is a cell or culture 
that is treated substantially the same as the cell or 
culture exposed to the test compound, except the control 
culture is not exposed to test compound. For example, in 
methods that use voltage clamp electrophysiological 
procedures, the same cell can be tested in the presence and 
absence of test compound, by merely changing the external 
solution bathing the cell. Another type of "control" cell 



or "control" culture may be a cell or a culture of cells 
which is identical to the transfected cells, except the 
cells employed for the control culture do not express 
functional human NMDA receptor subunits. In this 

situation, the response of test cell to test compound is 
compared to the response (or lack of response) of receptor- 
negative (control) cell to test compound, when cells or 
cultures of each type of cell are exposed to substantially 
the same reaction conditions in the presence of compound 
being assayed. 

In accordance with yet another embodiment of the 
present invention, the ion channel activity" of human 
N-methyl-D-aspartate (NMDA) receptors can be modulated by 
contacting such receptors with an effective amount of at 
least one compound identified by the above-described 
bioassay . 

In accordance with yet another embodiment of the 
present invention, there are provided antibodies generated 
against the above-described receptor proteins. Such 
antibodies can be employed for studying receptor tissue 
localization, subunit composition, structure of functional 
domains, as well as in diagnostic applications, therapeutic 
applications, and the like. Preferably, for therapeutic 
applications, the antibodies employed will be monoclonal 
antibodies . 

The above-described antibodies can be prepared 
employing standard techniques, as are well known to those 
of skill in the art, using the invention receptor proteins 
or portions thereof as antigens for antibody production. 
Both anti-peptide and anti-fusion protein antibodies can be 
used [see, for example, Bahouth et al. (1991) Trends 
Pharmacol Sci. vol. - 12: 338-343 ; Current Protocols in 
Molecular Biology (Ausubel et al., eds.) John Wiley and 
Sons, New York (1989)]. Factors to consider in selecting 



portions of the NT-1DAR subunits for use as immunogen (as 
either a synthetic peptide or a recombinant ly produced 
bacterial fusion protein) include antigenicity, 
accessibility (i.e., extracellular and cytoplasmic 
domains), uniqueness to the particular subunit, etc. 

The availability of subunit-specif ic antibodies 
makes possible the application of the technique of 
immunohistochemistry to monitor the distribution and 
expression density of various subunits (e.g., in normal vs 
diseased brain tissue). Such antibodies could also be 
employed for diagnostic and therapeutic applications. 

In accordance with still another embodiment of 
the present invention, there are provided methods for 
modulating the ion channel activity of receptor (s) of the 
invention by contacting said receptor (s) with an effective 
amount of the above-described antibodies. 

The antibodies of the invention can be 
administered to a subject employing standard methods, such 
as, for example, by intraperitoneal, intramuscular, 
intravenous, or subcutaneous injection, implant or 
transdermal modes of administration, and the like. One of 
skill in the art can readily determine dose forms, 
treatment regiments, etc, depending on the mode of 
administration employed. 



The invention will now be described in greater 
detail by reference to the following non-limiting examples. 
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Example 1 

TcniAtinn of DNA encoding hum a n NMDA receptor 

NMDAR1 subunits 

A> cDNA Library Screening 

5 RNA isolated from human hippocampus tissue was 

used as a template for the synthesis of oligo dT-primed and 
randomly primed, single-stranded cDNA according to standard 
procedures [see, for example, Maniatis et al. (1982) 
Molecular Cloning: A L aboratory Manual, Cold Spring Harbor 
10 Laboratory Press, Cold Spring Harbor, NY]- The single- 
stranded cDflA was converted to double-stranded^ cDNA, and 
2?coRI/SnaBI/XhoI adaptors were added to the ends thereof. 
The cDNAs were separated by size using agarose gel 
electrophoresis, and those that were >2.0 kb were ligated 
15 into £coRI-digested AgtlO bacteriophage vectors. The 
resulting cDNA library was amplified by replication of each 
clone through limited infection of a bacterial host, and 
stored at -70°C. 



20 



The amplified hippocampus oligo dT-primed cDNA 
library was later retrieved from storage and 1 x 10 



recombinants were screened for hybridization to 
oligonucleotides corresponding to nucleotides 96-128 (SE7) 
and nucleotides 2576-2609 (SE8) of the rat NMDAR1A receptor 
cDNA (see Moriyoshi et al . (1991) Nature 354:31). 
25 Hybridization was performed at 42 "C in 6X SSPE, 5X 
Denhart's solution, 10% formamide, 0.2% SDS and 200 ^g/ml 
herring sperm DNA. Washes were performed in IX SSPE and 
0.2% SDS at 50°C. Hybridizing clones (e.g. NMDA1-3) were 
identified. These clones hybridized to SE8 but not to SE7 . 

30 A randomly primed primary human hippocampus cDNA 

library (-2 x 10 5 recombinants prepared by selecting only 
cDNAs >2.0 kb for inclusion in the library) was screened 
under the same conditions for hybridization to 
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oligonucleotide SE8 and an oligonucleotide corresponding to 
nucleotides 129-141 of the rat NMDAR1A receptor cDNA 
(SEll) . Five hybridizing clones, which hybridized to SE8 
and not to SEll, ware identified: NMDA5-7 and NMDA10-11. 

5 B . Characterization of Clones 

The clones were plague purified and characterized 
by restriction enzyme mapping and DNA seguence analysis of 
the inserts. One of the clones, NMDA11 (see Seguence ID 
No. IB for a description of a portion of NMDAll),is a full- 

10 length cDNA (i.e., it contains translation initiation and 
termination " codons) encoding a complete NMDAR1 subunit. 
The remaining clones are partial cDNAs. Clones NMDA2 , 
NMDA3 (see Seguence ID No. ID), NMDA5, NMDA6, NMDA7 (see 
Seguence ID No. 1C) , and NMDA10 (see Seguence ID No. 1A for 

15 a description of a portion of NMDA10) contain a translation 
termination codon but lack nucleotides at the 5 1 end of the 
coding seguence. 

J 

Characterization of the clones revealed that the 
isolated cDNAs correspond to different alternatively 

20 spliced forms of the human NMDAR1 subunit transcript. The 
four types of alternate splicing represented by the clones 
are depicted schematically in Figure 1. Clone NMDA10 
(which lacks 5 1 untranslated seguences as well as 60 
nucleotides of the 5 f end of the coding seguence) is used 

25 as a reference to which the other variants are compared. 
Clone NMDA11 lacks 363 nucleotides (in the 3' portion of 
the clone) that are present in NMDA10. This 3 63 -nucleotide 
deletion does not disrupt the reading frame of the 
transcript; however, it results in a different termination 

30 codon. The last 69 nucleotides of the coding seguence of 
NMDA11 correspond to 3' untranslated seguence of clone 
NMDA10 (i.e., nucleotides 3325-3393 of Seguence ID No. 1). 
Clone NMDA7 lacks the same 3 63-nucleotide seguence that is 
deleted from NMDA11; however, NMDA7 further lacks 204 
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nucleotides at the 5' end that are present in NMDA10 and 
NMDA11. This 204-nucleotide deletion also does not disrupt 
the reading frame of the transcript. Additionally, NMDA7 
contains a 63-nucleotide in-frame insertion at the 5' end 
5 relative to NMDA10 and NMDA11. The last 69 base pairs of 
the coding sequence of NMDA7 correspond to 3' untranslated 
sequence of NMDA10 i.e., nucleotides 3325-3393 of Sequence 
ID No. 1). Clone NMDA3 lacks 1087 base pairs at the 3' end 
that are present in NMDA10. This 1087-base pair deletion 
10 does not disrupt the reading frame of the transcript; 
however it results in a different termination codon. The 
last 231 base pairs of the coding sequence of NMDA3 
correspond "\o 3' untranslated sequence of clone NMDA10 
(i.e., nucleotides 4049-4279 in Sequence ID No. 1). 

15 Example 2 

Pre Baration of full-lengt h NMDAR1 subunit cDNA constructs 

Portions of clones NMDA10 , NMDA11, NMDA7 and 
NMD A 3 were ligated together to construct full-length cDNAs 
encoding variants of the NMDA receptor NMDAR1 subunit. The 
20 full-length NMDAR1 subunit cDNAs were incorporated into 
vector pcDNAl (Invitrogen, San Diego, CA) for use in 
expressing the receptor subunits in mammalian host cells 
and for use in generating in vitro transcripts of the DNAs 
to be expressed in Xenopus oocytes. 

25 Vector pcDNAl is a pUC19-based plasmid that 

contains the following elements in the 5 ' -to-3 ' order: the 
cytomegalovirus (CMV) immediate early gene 

promoter/enhancer, the bacteriophage T7 RNA polymerase 
promoter, a polylinker, the bacteriophage SP6 RNA 

30 polymerase promoter, SV40 RNA processing (i.e., splice 
donor/acceptor) signals, SV40 polyadeny lation signal, and 
the ColEl origin and supF suppressor tRNA to permit 
maintenance of the vector in Escherichia coli strains with 
the P3 episome. This vector thus contains all the 



39 



regulatory elements required for expression of heterologous 
DNA in a mammalian host cell, wherein the heterologous DNA 
has been incorporated into the vector at the polylinker. 
In addition, because the T7 and SP6 promoters are located 
on either side of the polylinker, this plasmid can be used 
for synthesis of in vitro transcripts of heterologous DNA 
that has been sublconed into the vector at the polylinker. 



A. NMDARIA 

Full-length construct NMDARIA was prepared by 
10 ligation of a 5 ' portion of NMDA11 (beginning _ 5 • of the 
translation" initiation codon and extending to the HindHI 
site in the middle of the clone) and a 3 • portion of NMDA10 
(beginning at the HindHI site in the middle of the clone 
and extending 3' of the translation termination codon) as 
15 depicted in Figure 2. The two DNA fragments were joined in 
mammalian expression vector pcDNAl. 

Initially, the strategy for generating the NMDARl 
construct involved a first step of separately subcloning 
the entire 4 . 0 kb EcoRI insert fragment of NMDA10 and the 

20 entire 4 . 0 kb SnaBI insert fragment of NMDA11 into pcDNAl; 
however, two attempts employing this cloning strategy were 
unsuccessful. It appeared that there may have been 
selection against E. coli hosts retaining the complete 
insert fragments since the surviving recombinant E. coli 

25 that were analyzed contained incomplete insert cDNAs from 
which nucleotides had been deleted. Therefore, it was 
necessary to prepare the full-length NMDARIA construct in 
several steps by subcloning and combining various fragments 
of NMDA10 and NMDA11 in pcDNAl as follows (see Figure 3 for 

30 locations of restriction enzyme sites) . 

r 

Clone NMDA10 was digested with Bglll and EcoRI 
and the -3.3 kb fragment containing nucleotides 1020-4298 
of Sequence ID No. 1 was isolated and subcloned into 
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BamHI/iTcoRI-digested pcDNAl. The resulting plasmid was 
digested with HindUl and Nhel and the fragment containing 
nucleotides 2137-4298 of Sequence ID No. 1 plus a portion 
of pcDNAl was isolated. 

5 Clone NMDA11 was digested with EcoRl and Hindi I I 

and the -2.1 kb fragment containing nucleotides 1-2136 of 
Sequence ID No. 1 was isolated and subcloned into 
i?coRI/HindIII-digested modified pcDNAl (modified by 
deletion of the HindUl site located 5' of the EcoRI site 
10 in the polylinker and addition of a HindUl site into the 
polylinker -at a position 3- of the FcoRI site). The 
resulting plasmid was digested with Nhel and HindUl and 
the fragment containing nucleotides 1-2136 of Sequence ID 
No. 1 plus a portion of modified pcDNAl was isolated. This 
15 Whel/HindHI fragment was then ligated to the Hindlll/Nhel 
fragment containing nucleotides 2137-4298 of Sequence ID 
No. 1 to generate the full-length construct NMDAR1A (see 
Figure 2). The ligation mix was used to transform E. coli 
strain MC1061/P3. Because the Hhel site in pcDNAl occurs 
20 within the supF selection gene, only E. coli containing the 
correctly ligated, complete NMDAR1A plasmid (which has the 
complete, functional selection gene) were able to survive 
the selection process. This fragment subcloning strategy 
enabled selection of the desired correct NMDARlA-containing 
25 E. coli host cells, even though the total number of such 
recombinant host cells was small. 

In summary, construct NMDAR1A contains 2 61 base 
pairs of 5' untranslated sequence from NMDAR11 (nucleotides 
1-261 of Sequence ID No. 1) and a complete coding sequence 
30 (nucleotides 262-3078 of Sequence ID No.l) for the NMDAR1A 
variant of the NMDAR1 subunit as well as 1220 base pairs of 
3' untranslated sequence (nucleotides 3079-4298 of Sequence 
ID No. 1) . The NMDARlA-encoding sequence is operatively 



41 

linked to the regulatory elements in pcDNAl for express! 
in mammalian cells. 



B. NMDAR1 —A3 63 

Full-length construct NMDAR1-A3 6 3 was prepared by 
5 ligation of a 5 ' portion of NMDA11 (beginning 5' of the 
translation initiation codon and extending to the Hindlll 
site in the middle of the clone, i.e., nucleotides 1-2136 
in Sequence ID No. 1) and a 3 ' portion of NMDA11 (beginning 
at the Hindlll site in the middle of the clone and 
10 extending 3' of the translation termination codon, i.e., 
nucleotides~2137-2961 and 3325-4298 of Sequence'lD No. 1). 
As described above, due to the difficulty in directly 
subcloning the entire 4.0 kb SnaBI NMDA11 insert into 
pcDNAl, it was necessary to generate the construct by 
15 ligating two fragments of the NMDA11 insert into pcDNAl as 
follows (see Figure 3 for locations of restriction enzyme 
sites) . 

To obtain the 5' NMDA11 fragment, clone NMDA11 
was digested with £coRI and Hindlll and the -2.2 kb 

20 fragment containing nucleotides 1-2136 of Sequence ID No. 
1 was isolated and subcloned into £coRI/HindIII-digested 
modified pcDNAl (modified as described above) . The 
resulting plasmid was digested with Whel and Hindlll and 
the fragment containing nucleotides 1-2136 of Sequence ID 

25 No. 1 plus a portion of modified pcDNAl was isolated. 

To obtain the 3' NMDA11 fragment, clone NMDA11 
was digested with Bgill and JJcoRI and the 3.0 kb fragment 
containing nucleotides 1020-2961 and 3325-4298 of Sequence 
ID No. 1 was isolated and subcloned into BamHI/ScoRI- 
30 digested pcDNAl. The, resulting plasmid was digested with 
Hindlll and Nhel and the fragment containing nucleotides 
2137-2961 and 3325-4298 of Sequence ID No. 1 plus a portion 
of pcDNAl was isolated. This Hindlll/Nhel fragment was 
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then ligated to the Nhel /Hindi 1 1 fragment containing 
nucleotides 1-2136 of Sequence ID No. 1 to generate NMDAR1- 
A363 . 

In summary, construct NMDAR1— A3 63 contains 261 
5 base pairs of 5' untranslated sequence (nucleotides 1-261 
of Sequence ID No. 1) and a complete coding sequence for 
the NMDAR1-A3 63 variant NMDAR1 subunit (nucleotides 262 
2961 and 3325-3393 of Sequence ID No. 1) as well as 905 
base pairs of 3' untranslated sequence (nucleotides 3394- 
10 4298 of Sequence ID No. 1). Thus, NMDAR1— A3 6 3 differs from 
NMDAR1 in that it lacks 363 nucleotides (nucleotides 2962- 
3324 of Sequence ID No. 1) that comprise the last 117 
nucleotides of the coding sequence and the first 246 
nucleotides of the 3' untranslated sequence of NMDAR1 - The 
15 NMDAR1-A3 63 subunit variant-encoding sequence is 
operatives linked to the regulatory elements in pcDNAl for 
expression in mammalian cells. 

C. MMDA'RI -A1087 

• Full-length construct NMDAR1-A1087 was prepared 
20 by replacing the 3' end of the NMDAR1 variant-encoding 
insert of NMDAR1-A3 63 with a fragment from the 3' end of 
clone NMDA3 (see Figure 2). Plasmid NMDAR1-A3 63 was 
partially digested with PstI and completely digested wrth 
Xbal. There is a PstI site -112 nucleotides upstream of 
25 the location of the 363-nucleotide deletion in NMDAR1-A3 63 
and an Xbal site in the polylinker located downstream of 
the 3' untranslated sequence of NMDAR1-A3 63 (see Figure 3). 
Thus, Pstl/Xbal digestion of NMDAR1-A3 63 results in removal 
of a fragment containing nucleotides 2850-2961 and 3325- 
30 4298 of Sequence ID No. 1 from the vector. The larger 
fragment was isolated, from the digest. 

The insert of clone NMDA3 was cloned into the 
EcoRI restriction site(s) of pGEM (Promega, Madison, WI) ; 



and the resulting plasmid was digested with PstI and Xbal . 
The smaller fragment containing nucleotides 2850-2961 and 
4049-4298 of Sequence ID No. 1 was isolated and ligated to 
the larger fragment from the Pstl/Xbal digest of NMDAR1- 
A363. The resulting construct was designated NMDAR1-A1087 . 

In summary, NMDAR1-A1087 contains 261 base pairs 
of 5' untranslated sequence (nucleotides 1-261 in Sequence 
ID No. 1), the complete coding sequence for the NMDAR1- 
A1087 variant NMDAR1 subunit (nucleotides 262-2961 and 
4049-4279 of Sequence ID No. 1) and 19 base pairs of 3' 
untranslated sequence (nucleotides 4280-4298 of Sequence ID 
No. 1). Thus, NMDAR1-A1087 differs from NMDAR1 in that it 
lacks 1087 nucleotides (nucleotides 2962-4048 of Sequence 
ID No. 1) that comprise the last 117 nucleotides of the 
coding sequence and the first 970 nucleotides of the 3' 
untranslated sequence of NMDAR1 . The NMDAR1-A1087 subunit 
variant-encoding sequence is operatively linked to the 
regulatory elements in pcDNAl for expression in mammalian 
cells . 

D. • NMDAR1-T63-A204 

Full-length construct NMDAR1-I63-A204 was 
prepared by replacing a 13 99-nucleotide fragment of 
construct NMDAR1A (i.e, nucleotides 738-2136 of Sequence ID 
No. 1) with the PvuII-Hindlll fragment of NMDA7 (i.e., 
nucleotides 738-831 of sequence ID No. 1, plus nucleotides 
1-63 of Sequence ID No. 3 and nucleotides 832-984 and 1189- 
2136 of Sequence ID No. 1), as depicted in Figure 2. 
Because there are multiple PvuII sites in the NMDAR1 
construct, a several-step process was required for 
construction of NMDAR1-I63-A204 as follows (see Figure 3 
for the location of restriction enzyme sites) . 

The -2.2-kb EcoRI-Hindlll fragment isolated from 
construct NMDAR1A and containing nucleotides 1-2136 of 
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Sequence ID No. 1 was ligated with modified pcDNAl 
(modified as described in Example 2A) that had been 
digested with EcoRI and Hindlll. The resulting plasmid was 
digested with Avrll and self-ligated to remove two PvuII 
5 sites from a portion of the plasmid contributed by pcDNAl. 
The plasmid was then partially digested with PvuII and 
completely digested with Hindlll. The digest was ligated 
with a 1258-nucleotide PvuII-Hindlll fragment isolated from 
clone NMDA7. The resulting plasmid, designated NMDAR1-I63- 
10 A204-5', was digested with BamHI and tfindlll and the -2-kb 
fragment containing nucleotides 1-831 of Sequence ID No. 1, 
plus nucleotides 1-63 of Sequence ID No. 3 and nucleotides 
832-984 and 1189-2136 of Sequence ID No. 1 was isolated and 
ligated to BamHI/Hindlll-digested NMDAR1 to generate 
15 NMDAR1-I63-A204 . 

NMDAR1-I63-A204 contains 261 base pairs of 5' 
untranslated sequence (nucleotides 1-261 in Sequence ID 
No. 1), the complete coding sequence for the NMDAR1-I63- 
A204 variant NMDAR1 subunit (nucleotides 262-831 of 
20 Sequence ID No. 1 plus nucleotides 1-63 of Sequence ID No. 
3 and nucleotides 832-984 and 1189-3078 of Sequence ID No. 
1) and 1220 base pairs of 3' untranslated sequence 
(nucleotides 3079-4298 of Sequence ID No. 1) . Thus NMDAR1- 
I63-A204 differs from NMDAR1 in that it contains 63 
25 nucleotides that are not present in NMDAR1 (nucleotides 1- 
63 of Sequence ID No. 3) located between nt 831 and 832 of 
Sequence ID No. 1. Further, NMDAR1-I63-A204 lacks 204 
nucleotides that are present in NMDAR1 (nucleotides 985- 
1188 of Sequence ID No. 1). The NMDAR1-1 63 -A2 04 subunit 
30 variant-encoding sequence is operatively linked to the 
regulatory elements in pcDNAl for expression in mammalian 
cells . 



10 



20 



45 

NMDAR1-I63 
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Full-length construct NMDAR1-I63 can be described 
as NMDAR1 in which a 173-bp fragment (nucleotides 738-910 
of Sequence ID No. 1) is replaced with the 236-bp PvuII- 
Smal fragment of NMDA7 (nucleotides 738-831 of Sequence ID 
No. 1, plus nucleotides 1-63 of Sequence ID No. 3 and 
nucleotides 832-910 of Sequence ID No. 1). Because there 
are multiple PvuII sites in the NMDAR1 construct, a 
several-step process was required for construction of 
NMDAR1-I63 as follows. Plasmid NMDAR1-I63-A204-5 ' was 
partially digested with Smal and completely digested with 
Hindlll. The larger vector fragment was ligated with the 
1226-bp Smal/HindHI fragment isolated from NMDA11 
(nucleotides 911-2136 of Sequence ID No. 1) . The resulting 
15 vector was digested with BamHI and Hindlll and the -2.2-kb 
fragment containing nucleotides 1-831 of Sequence ID No. 1, 
plus nucleotides 1-63 of Sequence ID No. 3 and nucleotides 
832-2136 of Sequence ID No. 1 was isolated and ligated to 
BamHI/ffindlll-digested NMDAR1 to generate NMDAR1-I63. 



NMDAR1-I63 contains 261 base pairs of 5' 
untranslated sequence (nucleotides 1-261 in Sequence ID No. 
1) , the complete coding sequence for the NMDAR1-I63 variant 
NMDAR1 subunit (nucleotides 262-831 of Sequence ID No. 1, 
plus nucleotides 1-63 of Sequence ID No. 3 and nucleotides 
25 832-3078 of Sequence ID No. 1) and 1220 nucleotides of 3' 
untranslated sequence (nucleotides 3079-4298 of Sequence ID 
No. 1). Thus, NMDAR1-I63 differs from NMDAR1 in that it 
contains 63 nucleotides that are not present in NMDAR1 
(nucleotides 1-63 of Sequence ID No. 3), located between 
nucleotides 831 and 832 of Sequence ID No. 1. The 
NMDAR1-I63 subunit variant-encoding sequence is operatively 
linked to the regulatory elements in pcDNAl for expression 
in mammalian cells. 
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F. NMDAR1-I6 3-A2 04-A3 6 3 

Full-length construct NMDAR1-I 6 3 -A2 04 -A3 63 was 
prepared by replacing the 2861 nucleotide fragment from 
construct NMDAR1-I63-A2 04 (ie, nucleotides 1438-4298 
5 Sequence ID. No. 1) with the KpnI-Xbal (polylinker site) 
fragment of NMDAR1-A3 6 3 (ie, nucleotides 1438-2961 and 
3325-4298 of Sequence ID No. 1) as depicted in Figure 2. 
The NMDAR1-I63-A204 was completely digested with Xbal then 
partially digested with Kpnl due to the presence of two 
10 additional Kpnl sites in the vector sequence. The 
resulting 5i NMDAR1-I63-A204 fragment, which includes the 
pcDNAI vector sequences, was ligated with the 3 f Kpnl-Xbal 
fragment from NMDAR1-A3 6 3 to generate NMDAR1-I63 -A2 04 -A3 63 . 

In summary, construct NMDAR1-I63-A204-A3 63 

15 contains 261 base pairs of 5 1 untranslated sequence 
(nucleotides 1-261 in Sequence ID No. 1), the complete 
coding sequence for the NMDAR1-I63-A204-A363 variant 
NMDAR1A subunit (nucleotides 262-831 of Sequence ID No. 1, 
plus nucleotides 1-63 of Sequence ID No. 3, plus 

20 nucleotides 832-984, 1189-2961 and 3325-3393 of Sequence ID 
No. 1) as well as 905 base pairs of 3 f untranslated 
sequence (nucleotides 3394-4298 of Sequence ID. No. 1) . 
Thus, NMDAR1-I63-A204-A363 differs from NMDAR1A in that it 
contains 63 nucleotides that are not present in NMDAR1A 

25 (nucleotides 1-63 of Sequence ID No. 3) located between 
nucleotides 831 and 832 of Sequence ID No. 1. Further, 
NMDAR1-I63-A204-A363 lacks 204 nucleotides that are present 
in NMDAR1A (nucleotides 985-1188 -of Sequence ID No. 1) and 
363 nucleotides that are present in NMDAR1A (nucleotides 

30 2962-3324 of Sequence ID No. 1) that comprise the last 117 
nucleotides of the coding sequence and the first 246 
nucleotides of the 3 1 untranslated sequence of NMDAR1A. 
The NMDAR1-I63-A204-A363 subunit variant encoding sequence 
is operatively linked to the regulatory elements in pcDNAI 

35 for expression in mammalian cells. 
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G . NMr)ARl-I *t-A2 04-Al08 7 

Full-length construct NMDAR1-I63-A204-A1087 was 
prepared by replacing the 2861 nucleotide fragment from 
construct NMDAR1-I 63-A2 04 (ie, nucleotides 1438-4298 
Sequence ID. N. 1) with the KpnI-Xbal (polylinker site) 
fragment of NMDAR1-A1087 (ie, nucleotides 1438-2961 and 
4049-4298 of Sequence ID No. 1) as depicted in Figure 2. 
The NMDAR1-I63-A204 was completely digested with Xbal then 
partially digested with Kpnl due to the presence of two 
additional Kpnl sites in the vector sequence. The 
resulting 5' NMDAR1-I63-A204 fragment, which includes the 
pcDNAI vector sequences, was ligated with the 3' KpnI-Xbal 
fragment from NMDAR1-A1087 to generate NMDAR1-I63-A204- 
A1087 . 

In summary, construct NMDAR1-I63-A204-A1087 
contains 261 base pairs of 5' untranslated sequence 
(nucleotides 1-261 in Sequence ID No. 1), the complete 
coding sequence for the NMDAR1-I63-A204-A363 variant 
NMDAR1A subunit (nucleotides 262-831 of Sequence ID No. 1, 
plus nucleotides 1-63 of Sequence ID No. 3, plus 
nucleotides 832-984, 1189-2961 and 4280-4298 of Sequence ID 
No. 1) as well as 19 base pairs of 3' untranslated sequence 
(nucleotides 4280-4298 of Sequence ID. No. 1) . Thus, 
NMDAR1-I63-A204-A1087 differs from NMDAR1A in that it 
contains 63 nucleotides that are not present in NMDAR1A 
(nucleotides 1-63 of Sequence ID No. 3) located between 
nucleotides 831 and 832 of Sequence ID No. 1. Further, 
NMDAR1-I63-A204-A1087 lacks 204 nucleotides that are 
present in NMDAR1A (nucleotides 985-1188 of Sequence ID No. 
1) and 1087 nucleotides that are present in NMDAR1A 
(nucleotides 2962-4048 of Sequence ID No. 1) that comprise 
the last 117 nucleotides of the coding sequence and the 
first 970 nucleotides' of the 3' untranslated sequence of 
NMDAR1A. The NMDAR1-I63-A204-A1087 subunit variant 
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encoding sequence is operatively linked to the regulatory 
elements in pcDNAI for expression in mammalian cells. 



H. 



Additional Constructs Containing Full-Length 

cDNAs Encoding Variants of the NM0AR1 Subunit 



10 
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Additional full-length cDNAs encoding further 
possible NMDAR1 variants can be constructed using methods 
similar to those described in Examples 2A-G above. 
Specifically, the following constructs can be prepared by 
ligating portions of clones NMDA11, NMDA10, NMDA7 and NMDA3 
as depicted in Figure 2: 



NMDAR1-A204 
NMDAR1-A2 04-A3 6 3 
NMDAR1-I63-A3 63 
NMDAR1-I63-A108 7 
NMDAR1-A204-A1087 



(Sequence ID No. 1J) 

(Sequence ID No. IK) 

(Sequence ID No. 1M) 

(Sequence ID No. IN) 

(Sequence ID No. 1L) 



The full-length cDNAs can also be incorporated into 
mammalian expression vectors such as pcDNAI, as described 
in Examples 2A-G. 

Several methods can be employed to determine 
20 which NMDAR1 subunit variants are actually expressed in 
various human tissues. For example, oligonucleotides 
specific for the nucleotide sequences located 5' and 3 1 of 
the insertions and deletions of the NMDAR1 transcripts 
described herein can be used to prime nucleic acid 
25 amplifications of RNA isolated from various tissues and/or 
cDNA libraries prepared from various tissues. The presence 
or absence of amplification products and the sizes of the 
products indicate which variants are expressed in the 
tissues. The products can also be characterized more 
3 0 thoroughly by DNA sequence analysis. 
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RNase protection assays can also be used to 
determine which variant transcripts are expressed in 
various tissues. These assays are a sensitive method for 
detecting and quantitating an RNA species in a complex 
5 mixture of total cellular RNA. A portion of the NMDAR1 
subunit variant DNA is labeled and hybridized with cellular 
RNA. If complementary mRNA is present in the cellular RNA, 
a DNA-RNA hybrid results. The RNA sample is then treated 
with RNase, which degrades single-stranded RNA. Any RNA- 
10 DNA hybrids are protected from RNase degradation and can be 
visualed by gel electrophoresis and autoradiography. 

Further information on possible splice variants 
of the NMDAR1 primary transcript can be obtained by 
isolation of genomic clones containing NMDAR1 subunit- 
15 encoding sequences (for example, by hybridization to the 
human NMDAR1 subunit cDNAs disclosed herein) and subsequent 
characterization of the resulting clones. 



Example 3 

Isolation of DNA Encoding Human NMDA Receptor 
2 0 NMDAR2C Subunits 



Degenerate oligonucleotides were synthesized 
based on two conserved regions of rat NMDAR2A, NMDAR2B and 
NMDAR2C DNAs that encode the putative first and fourth 
transmembrane domains. In rat NMDAR2A DNA, these regions 

25 are encoded by nucleotides 1669-1692 (oligo SE74) and 2437- 
2465 (olig SE75) , respectively. [see Monyer et al . (1992) 
Science 256 : 1217 -1221] . These oligonucleotides were used 
to prime nucleic acid amplification of cDNAs prepared from 
RNA isolated from human hippocampus, cerebellum, and 

30 orbitof rontal tissue. Two products, a 795-bp and a 640-bp 
fragment, were detected when the reaction mixture was 
analyzed by gel electrophoresis and ethidium bromide 
staining. The 795-bp fragment amplified from the 

cerebellum cDNA was subcloned into PCR1000 (Invitrogen, San 



50 



Diego, CA) and characterized by DNA sequence analysis, 
which' revealed that it is -86% similar to the rat NMDAR2 A 
DNA sequence, -78% similar to the rat NMDAR2B DNA sequence, 
and -74% similar to the rat NMDAR2C DNA sequence. Thus, 
5 this plasmid was named pcrNMDAR2 A . 

The 795-bp insert from pcrNMDAR2 A was used to 
screen 1 x 10 6 recombinants of a human hippocampus cDNA 
library (prepared by using random primers to synthesize 
cDNAs from hippocampus tissue and selecting fragments >2.0 

10 kb for insertion into ylgtlO vectors) and a human cerebellum 
cDNA library (random-primed library size-selected for 
fragments > 2 . 8 kb in ^gtlO) . Hybridization was performed 
in 5X SSPE, 5X Denhart ' s solution, 50% deionized f ormamide, 
0.2% SDS, 200 ng/ml sonicated, denatured herring sperm DNA 

15 at 42°C. Washes were performed in IX SSPE, 0.2% SDS at 
55°C. The probe hybridized to 11 plaques from the 
hippocampus library and 8 plaques from the cerebellum 
library. 

DNA sequence analysis and/or restriction enzyme 
20 mapping of 15 of the hybridizing plaques that were purified 
surprisingly revealed that they were more similar to rat 
NMDAR2C DNA than to rat NMDAR2A DNA. All of the clones 
were partial cDNAs (i.e., they lacked a translation 
initiation and/or termination codon) and were designated as 
25 NMDAR2C cDNAs . Comparison of the clones revealed that the 
human NMDAR2C subunit transcript is differentially 
processed . 

Clones NMDA26, NMDA24 , NMDA22 and NMDA21 (see 
Figure 4) represent four basic clones that were identified, 
30 all of which are believed to be splice variants. Clone 
NMDA26 (Sequence ID No. 5D) is used as a reference to which 
the other variants can be compared. Clone NMDA24 (Sequence 
ID No. 5C) contains a 24-bp sequence (see Sequence ID No. 
7) that is not present in NMDA26. Clone NMDA22 (Sequence 
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ID No. 5B) lacks 15 bp that are present in NMDA26, and 
clone NMDA21 (Sequence ID No. 5A) lacks 51 bp that are 
present in NMDA26. Clones NMDA22 and NMDA24 both contain 
an 11-bp sequence (Sequence ID No. 9) that is not present 
5 in NMDA26 (between nucleotides 1116-1117 of Sequence ID No. 
5) . Introduction of this sequence into these clones 
(between nucleotides 1116-1117 of Sequence ID No. 5) 
disrupts the reading frame of the transcript and introduces 
a premature translation termination (i.e., STOP) codon into 
10 the transcript. 

Clones NMDA26 and NMDA27 (see Figure 4) are 
partial NMDAR2C cDNAs that contain 5' untranslated 
sequence, a translation initiation codon and some of the 
coding sequence. Clone NMDA26 contains 188 base pairs of 

15 5 1 untranslated sequence whereas clone NMDA27 contains -1.1 
kb of 5' untranslated sequence. The sequences of the 5 1 
untranslated regions of these two clones are identical for 
the first 15 nucleotides proceeding 5 1 of the translation 
initiation codon. However, beginning with the 16th 

20 nucleotide 5* of the translation initiation codon, the 
sequences of the two clones diverge (compare nucleotides 
116-191 of Sequence ID No. 5 to nucleotides 1 - 74 of 
Sequence ID No. 12). 

Example 4 

2 5 Preparation of Full-length NMDAR2C Subunit cDNA Constructs 

Portions of the partial NMDAR2C clones can be 
ligated in a variety of ways to generate constructs 
encoding full-length NMDAR2C subunit variants. The 5» end 
of each NMDAR2C cDNA can be contributed by NMDA26, whereas 

3 0 the 3' ends of the constructs are contributed by various 

combinations of clones NMDA21, NMDA2 2, and NMDA24. Figure 
5 depicts full-length .NMDAR2C constructs and indicates the 
portions of the different clones that contribute to each 
construct . 
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For example, full-length constructs can be 
prepared using methods such as those described in Example 
2 for preparing NMDAR1 constructs. Thus, clone inserts are 
transferred into a vector (e.g., pcDNAl) for ease of 
5 manipulation and then desired portions of the cDNAs are 
isolated by restriction enzyme digestion of the vectors. 
This can require several steps and/or partial digests if, 
for example, there are no unique restriction enzyme sites 
surrounding the desired portions of the cDNAs . The desired 
10 cDNA fragments are then ligated and incorporated into an 
expression plasmid such as pcDNAl or pCMV-T7-2. 

Plasmid pCMV-T7-2 (see Figure 6) is a pUC19-based 
vector that contains a cytomegalovirus (CMV) 
promoter/ enhancer, SV40 splice donor/splice acceptor sites 

15 located immediately downstream of the promoter, a T7 
bacteriophage RNA polymerase promoter positioned downstream 
of the SV40 splice sites, an SV40 polyadeny lation signal 
downstream of the T7 promoter, and a poly linker between the 
T7 promoter and the polyadeny lation signal. This vector 

20 thus contains all the regulatory elements required for 
expression of heterologous DNA in a mammalian host cell, 
wherein the heterologous DNA has been incorporated into the 
vector at the polylinker. In addition, because the T7 
promoter is located just upstream of the polylinker, this 

25 plasmid can be used for synthesis of in vitro transcripts 
of heterologous DNA that has been subcloned into the vector 
at the polylinker. Plasmid pCMV-T7-3, also depicted in 
Figure 6, is identical to pCMV-T7-2 except that the order 
of the restriction enzyme sites in the polylinker is 

30 reversed. This plasmid can also -be used for heterologous 
expression of NMDAR subunit DNA. 

Construct pcDNAl-26-NotI-24-5 ' UT contains 188 
base pairs of 5' untranslated sequence (nucleotides 1-188 
of Sequence ID No. 5), the complete coding sequence of the 
35 first variant of the human NMDAR2C subunit (nucleotides 
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189-3899 of Sequence ID No. 5) and -440 base pairs of 3' 
untranslated sequence (nucleotides 3900-4340 of Sequence ID 
No. 5). The NMDAR2C cDNA is contained within the 

polylinker of expression vector pcDNAl for expression. 

5 Construct pCMV-2 6-NotI-2 4 (Sequence ID No. 5) 

contains 49 base pairs of 5' untranslated sequence 
(nucleotides 140-188 of Sequence ID No. 5), the complete 
coding sequence of a first variant of the human NMDAR2C 
subunit (nucleotides 189-3899 of Sequence ID No. 5) and 
10 -440 base pairs of 3' untranslated sequence (nuceotides 
3900-4340 of Sequence ID No. 5). The NMDAR2C cDNA is 
contained within the polylinker of expression vector pCMV- 
T7-2 for expression. 

Construct P CMV-26-ScaI-24 (Sequence ID No. 5E) is 
15 identical to P CMV-26-NotI-24 , except it contains 24-base 
pairs (Sequence ID No. 7) inserted between nucleotides 2350 
and 2351 of Sequence ID No. 5. 

Construct P CMV-26-ScaI-22 (Sequence ID No. 5F) is 
identical, to P CMV-26-NotI-24 , except that it lacks 15-base 
20 pairs (nucleotides 1960-1974 of Sequence ID No. 5). 

Construct pCMV-2 6-ScaI-2 l-NotI-24 (Sequence ID 
No. 5G) is identical to P CMV-26-NotI-24 , except that it 
lacks 51-base pairs (nucleotides 2351-2401 of Sequence ID 
No. 5) . 
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Construct NMDAR2C-A15-I24 (Sequence ID No. 5H) is 
identical to P CMV-26-NotI-24 , except that it lacks 15-base 
pairs (i.e., nucleotides 1960-1974 of Sequence ID No. 5) 
and includes a 24-base pair sequence (i.e., Sequence ID No. 
7; inserted between nucleotides 2350 and 2351 of Sequence 
30 ID No. 5) . 
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Construct NMDAR2C-A15-A51 (Sequence ID No. 51) is 
identical to P CMV-26-NotI-24 , except that it lacks 15-base 
pairs (i.e., nucleotides 1960-1974 of Sequence ID No. 5) 
and 51-base pairs (i.e., nucleotides 2351-2401 of Sequence 
5 ID No. 5) . 

Additional" full-length NMDAR2C constructs can 
readily be prepared as described herein. For example, 5' 
untranslated sequence obtained from NMDA27 (instead of 
NMDA26) can be employed, and the 3' ends of the constructs 
10 can be contributed by various combinations of clones 
NMDA21, NMDA2 2, and NMDA2 4 . 

Several methods (e.g., nucleic acid 

amplification, RNase protection assays, etc.), as described 
in Example 2, can be employed to determine which NMDAR2C 
15 subunit variants are actually expressed in various human 
tissues . 

Human NMDAR2C has 83.5% GC nucleotide content 
between nucleotides 2957 and 3166. To potentially enhance 
NMDAR2D subunit expression, the GC content in this region 
20 can be reduced while maintaining the native amino acid 
sequence. Synthetic DNAs can be made by oligonucleotide 
primer extension across this region. Four 
oligonucleotides, SE343 (Sequence ID No. 17), SE344 
(Sequence ID No. 18), SE345 (Sequence ID No. 19), and SE346 
25 (Sequence ID No. 20) were synthesized. These primers 
maintain the amino acid sequence of the human NMDAR2D 
receptor and some restriction sites, but lower the overall 
GC content of this region to 53.4%. The criteria for the 
modification of bases were: 1) to not have more than 4 
30 guanine nucleotides in a row if at all possible, 2) to 
maintain the restriction cutting sites for NotI 
(nucleotides 2962 - 2969 of Sequence ID No. 5), Avail 
(nucleotides 3069 - 3073 Sequence ID No. 5), and Aatll 
(nucleotides 3156 - 3161 of Sequence ID No. 5), 3) to 
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reduce the secondary structure of the oligonucleotides as 
much as possible, 4) to not introduce any additional Notl, 
Avail or Aatll restriction sites into the sequence and 5) 
to have the basepair overlap between oligonucleotide pairs, 
5 {SE343 and SE344} or {SE345 and SE346} have a proposed 
melting temperature between 62-66°C. The oligonucleotide 
pair SE343 and SE344 have complementary sequence from 
nucleotides 51 - 71 of Sequence ID Nos. 17 and 18. The 
oligonucleotide pair SE345 and SE346 have complementary 
10 sequence from nucleotides 42 - 61 of Sequence ID No. 19 and 
nucleotides 43 - 62 of Sequence ID No. 20, resepectively . 

The primer pairs, {SE343 and SE344} and {SE345 
and SE346}, are combined in a standard PCR reaction 
mixture, which contains 50 pmoles of each oligonucleotide, 
15 and are amplified according to the following PCR protocol: 

Annealing temperature of 55 °C for 1 
min, extension temperature of 72 °C for 
2 min and melting temperature, 96 °C for 
30 seconds for 30 cycles. 

20 The resulting 121 bp PCR product from the primer pair 
SE343-SE344 is digested with Notl and Aval, and the 
resulting 103 bp PCR product from the primer pair SE345- 
SE346 is digested with Aval and Aatll. These fragments are 
ligated into P CMV-NMDAR2C-26-NotI-24 , which has been 

25 partially digested with both Notl and Aatll due to the 
presence of additional Notl and/or Aatll restriction sites 
in the vector sequence, to form P CMV-26-NotI-24-GCMOD. 
This construct, P CMV-26-NotI-24-GCMOD, contains nucleotides 
140-2965 of Sequence ID No. 5, followed by the 195 

30 nucleotides set forth in Sequence ID No. 21, and then 
nucleotides 3161 to 4340 of Sequence ID. No. 5. 
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Example 5 

Tcnl.tion of DNA Encod ing Human NMDA Receptor 
NMDAR2 A Subunits 

Two human cDNA libraries were prepared using 
different oligonucleotides (random and specific primers) to 
prime cDNA synthesis from RNA isolated from cerebellum 
tissue. The specific primer used for first-strand 

synthesis was SE162, nucleotides 904 to 929 of Sequence ID 
No. 10. cDNAs synthesized by random priming that ranged in 
size from 1.0-2.8 kb, and cDNAs synthesized by specific 
priming that ranged in size from 0.6-1.1 kb, were isolated 
and inserted into the AqtlO phage vector to generate the 
two libraries. 

The random-primed library (3 x 10 6 recombinants) 
was screened for hybridization to the 795-base pair insert 
from pcrNMDAR2A (see Example 3) in 5X SSPE, 5X Denhart ' s 
solution, 50% deionized formamide, 0.2% SDS, 200 /xg/ml 
sonicated, denatured herring sperm DNA at 42°C. Washes 
were performed in IX SSPE, 0.2% SDS at 55°C. The probe 
hybridized to 11 plaques. 

The specifically-primed library (6 x 10 5 
recombinants) was screened for hybridization to 
oligonucleotide SE177 (nucleotides 859 to 884 of Sequence 
ID No. 10) in 6X SSPE, 5X Denhart ' s solution, 10% deionized 
formamide, 0.2% SDS, 200 ng/ml sonicated, denatured herring 
sperm DNA at 42 °C. Washes were performed in IX SSPE, 0.2% 
SDS at 50°C. The probe hybridized to 2 plaques. 

Nine of the hybridizing plaques were purified and 
the inserts were characterized by restriction enzyme 
mapping and DNA sequence analysis. All clones contained 
partial cDNAs . Two of the clones, NMDA53 and NMDA54 , 
contain the translation initiation codon and 320 base pairs 
and 88 base pairs, respectively, of 5' untranslated 
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sequence. The sequences of four other clones, NMDA47, 
NMDA49, NMDAR50 and NMDA5 1 , along with those of NMDA53 and 
NMDA54 , overlap to comprise -70% of the human NMDAR2 A 
subunit coding sequence (see nucleotides 1 - 3084 of 
Sequence ID No. 10) . 

To obtain clones containing the remaining -1300 
base pairs of 3' sequence needed to complete the NMDAR2A 
coding sequence, 6.6 x 10 6 recombinants of an additional 
human cDNA library (an amplified randomly primed cerebellum 
cDNA library with inserts ranging from 1.0 - 2.8 kb in 
length) were screened for hybridization to an 
oligonucleotide corresponding to the 3' end of clone NMDA51 
(oligo SE171; nucleotide 3454 to 3479 of Sequence ID No. 
10) using the same conditions as used for screening the 
specifically primed cerebellum cDNA library as described 
above. Four hybridizing plaques were purified and the 
inserts were characterized by DNA sequence analysis to 
determine if they contain the 3' end of the coding sequence 
and a translation termination codon. Two of the clones 
(NMDA57 and NMDA58, which were determined to be identical), 
contain a. translation termination codon, as determined by 
DNA sequence analysis. Phage lysate containing clone 
NMDA57 were deposited under the provisions of the Budapest 
Treaty with the American Type Culture Collection (ATCC) on 
April 13, 1993, and assigned Accession No. 75442. 

Example 6 

Preparation of Full-length NMDAR2 A Subunit cDNA Constructs 

Two separate constructs encoding a full-length 
NMDAR2A subunit ( p CM V - h N M D A R 2 A - 1 ( 5 3 ) and 

pCMV-hNMDAR2A-2 (54) were prepared by ligating portions of 
the following partial NMDAR2A clones: NMDAR47, NMDAR50, 
NMDAR58 and either NMDAR53 or NMDAR54 (NMDAR53 and NMDAR54 
differ only in the amount of 5' untranslated sequence 
contained in the clones. The inserts of clones NMDA47, 
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NMDA50 and NMDA58 were isolated as EcoRI fragments and 
ligated with ScoRI-digested pCMV-T7-2 to create pNMDA4 7 , 
pNMDASO and pNMDA58 , respectively. The inserts of clones 
NMDA53 and NMDA54 were isolated as Xhol fragments and 
ligated with Sal I-digested pCMV-T7-2 to create pNMDA53 and 
pNMDA54, respectively. 

pNMDA47 was digested with Seal and Nsil to 
liberate an -3,350-bp fragment containing a 3' portion of 
the ^-lactamase gene, which encodes a protein which imparts 
ampicillin-resistance, and nucleotides 824-2415 of Sequence 
ID No. 10. This fragment was ligated with a -2890-bp 
Nsil/Scal fragment of pNMDA50 (containing a 5' portion of 
the /?-lactamase gene and nucleotides 2416-3346 of Sequence 
ID No. 10) to generate pNMDA47+50. 

The portion of pNMDA58 that encodes the 3 * end of 
NMDAR2A contains two Mscl sites. Because the 3' Mscl site 
is cleaved in preference to the 5' Mscl site, partial 
digestion of pNMDA5 8 was not an option. Thus, pNMDA58 was 
digested with Seal /Mscl, and the -2020-bp fragment 
containing a 5' portion of the ^-lactamase gene and a 3' 
portion of the insert (nucleotides 4751-4808 of Sequence ID 
No. 10) was isolated. This fragment was ligated to a 
-4150-bp Seal /Mscl fragment of P NMDA47+50 (containing a 3' 
portion of the ^-lactamase gene and nucleotides 824-3212 of 
Sequence ID No. 10) to generate pNMDA47+50+3 1 END58 . This 
plasmid contained a complete ^-lactamase gene and 
nucleotides 824-3214 and 4751-4808 of Sequence ID No. 10. 
To add nucleotides 343-4750 of Sequence ID No. 10 to 
pNMDA47 + 50 + 3 ' END58 , pNMDA5 8 was digested with Mscl, and the 
isolated 1537-bp fragment consisting of nucleotides 3213- 
4750 of Sequence ID No. 10 was ligated to Mscl-digested 
pNMDA47+50+3 • END58 . The resulting plasmid, pNMDA47+50+58 , 
contained nucleotides 824-4808 of Sequence ID No. 10. 
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To generate two constructs containing identical 
NMDAR2A coding sequences but differing amounts of 5' 
untranslated sequence, pNMDA53 and pNMDA54 were digested 
with ScaI/£coRI to liberate fragments containing a 3' 
5 portion of the ^-lactamase gene and nucleotides 1-854 and 
225-854 of Sequence ID No. 10, respectively. pNMDA4 7 + 50 + 58 
was digested with- Scal/JScoRI (partial) and the 3954-bp 
fragment containing a 5' portion of the ^-lactamase gene 
and nucleotides 855-4808 of Sequence ID No. 10 was 
10 separately ligated with the Scal/EcoRl fragments of pNMDA53 
and pNMDA54 to generate P CMV-hNMDAR2 A-l ( 53 ) and pCMV- 
hNMDAR2A— 2 (54) , respectively. These two constructs are 
identical except for the amount of 5' untranslated sequence 
contained in each. Both contain a full-length NMDAR2A- 
15 encoding sequence (nucleotides 311-4705 of Sequence ID No. 
10) and 103 nucleotides of 3' untranslated sequence 
(nucleotides 4706-4808 of Sequence ID No. 10) . pCMV- 
hNMDAR2A— 1 (53 ) contains 310 nucleotides of 5' untranslated 
sequence (nucleotides 1-310 of Sequence ID No. 10), whereas 
20 P CMV-hNMDAR2A-2(54) contains 87 nt of 5 ' untranslated 
sequence (nucleotides 224-310 of Sequence ID No. 10). The 
NMDAR2A cDNA is operatively linked to the regulator 
elements of pCMV-T7-2 for expression in mammalian host 
cells . 

25 There is no unique restriction site 3' of the 

NMDAR2 A— specif ic DNA in pCMV-hNMDAR2A-l ( 53 ) that can be 
used to linearize the plasmid in order to prepare in vitro 
transcripts for injection into Xenopus oocytes. To make a 
construct that has a unique 3' restriction site (pCMV- 

30 hNMDAR2A-3 (53) ) , essentially the entire NMDAR2A— specif ic 
DNA of P CMV-hNWDAR2A-l(53) was transferred into vector 
pCMV-T7-3 as follows. pCMV-NMDAR2 A-l ( 53 ) was digested with 
NotI and the -4.4-kb fragment was isolated and ligated with 
Notl-digested pCMV-T7-3 to generate pCMV-hNMDAR2 A— 3 (53) . 
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Example 7 

Isolation of DNA Encoding Human NMDA Receptor 
NMDAR2B Subunits 

A human fetal brain AZAP cDNA library (1 x 10 6 
5 recombinants; Stratagene, La Jolla, CA) was screened for 
hybridization to a DNA fragment containing the entire rat 
NMDAR2B subunit coding sequence (see Monyer et al. (1992) 
Science 256 : 1217-1221) . Hybridization was conducted in 50% 
deionized formamide, 5X Denhart's solution, 5X SSPE, 200 

10 fig/ml sonicated, denatured herring sperm DNA and 0.2% SDS 
at 42°C. Washes were performed in 0 . 5X SSPE, 0.2% SDS at 
65°C. One of the hybridizing clones excised from the human 
fetal brain library, NMDA81, containing a 5,435 bp insert 
and translation initiation and termination codons, encodes 

15 a full-length NMDAR2B subunit. This excised plasmid, which 
is in the pBluescript vector, was called pBS-hNMDAR2B . 

NMDA81 was digested with EcoRI/EcoRV and the 
~5.5-kbp fragment was isolated and ligated to EcoRl/EcoRV- 
digested pCMV-T7-3. The resulting construct, pCMVPL3- 

20 hNMDAR2B , contains the NMDAR2B coding sequence (nucleotides 
210-4664 of Sequence ID No. 13) , as well as 209 nucleotides 
of 5 1 untranslated sequence (nucleotides 1-209 of Sequence 
ID No. 13) and 339 nucleotides of 3' untranslated sequence 
(nucleotides 4665-5003 of Sequence ID No. 13) . The 

25 NMDAR2B-encoding DNA in this construct is operatively 
linked to regulatory elements in pCMV-T7-3 for expression 
in mammalian host cells. 



Example 8 

Isolation of DNA Encoding Human NMDA 
3 0 Receptor NMDAR2D subunits 



A human fetal brain cDNA library (1 x 10 6 
recombinants; Stratagene, La Jolla, CA) was screened by 
subtraction screening methods for DNA encoding a human 



NMDAR2D receptor subunit. In this method, plaques were 
selected on the basis of weak or no hybridization to DNAs 
:oding human NMDAR2A, NMDAR2B and NMDAR2C subunits. 
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Initially, the library was screened for 
hybridization to pcrNMDAR2A (see Example 3) under low- 
stringency conditions (30% deionized formamide, 5X 
Denharf s solution, 5X SSPE, 200 ng/ml sonicated herring 
sperm DNA, 0.2% SDS at 42°C). Washes were also performed 
using low-stringency conditions (2X SSPE, 0.2% SDS, 50°C). 
The filters were stripped, then screened for hybridization 
to the pcrNMDAR2 A fragment and to an -1200 bp PstI fragment 
of DNA encoding a human NMDAR2B subunit (see Example 7) and 
an -950 bp AccI fragment of DNA encoding a human NMDAR2C 
subunit (see Example 3). These fragments contain DNA 
encoding all of the putative transmembrane domains of the 
subunits. Hybridization was performed under high- 

stringency conditions (50% deionized formamide, 5X 
Denhart's solution, 5X SSPE, 200 ng/ml sonicated herring 
sperm DNA, 0.2% SDS at 42°C) as were washes (0.1X SSPE, 
0. 1% SDS, 65°C) . 

Eighteen of the plaques that hybridized weakly to 
pcrNMDAR2A in the initial low stringency screening of the 
library hybridized only weakly or not at all to portions of 
DNA encoding human NMDAR2A, NMDAR2B and NMDAR2C subunits in 
the high stringency screening. The plaques were purified, 
and the insert fragments were characterized by DNA sequence 
analysis. One of the inserts, NMDA96, corresponds to the 
3 1 half of the human NMDAR2D subunit gene coding sequence. 
The sequence of this clone is provided in Sequence ID No. 
15. 

To obtain clones containing the remaining -2000 
bp of 5 1 sequence needed to complete the NMDAR2D subunit 
coding sequence, the human fetal brain cDNA library was 
screened for hybridization to an -831 bp Swal fragment of 



the clone containing the 3' half of the NMDAR2D coding 
sequence under high stringency hybridization and washing 
with 0.5X SSPE, 0.2% SDS at 65°C. Nine hybridizing plaques 
were purified and analyzed by DNA sequencing, which 
revealed that none of the plaques contain DNA encoding a 
translation initiation codon and extending 3' to at least 
the 5' end of the clone containing the 3' half of the 
NMDAR2D coding sequence. 

A human cDNA library was prepared using a 
specific oligonucleotide, SE296, to prime cDNA synthesis 
from RNA isolated from human fetal brain. The specific 
primer used for first-strand synthesis was SE296 
(nucleotides 2920-2949 of Sequence ID No. 15). cDNAs 
synthesized by specific priming that were greater than 2.2 
kb in size were isolated and inserted into the AzhPU phage 
vector to generate the library. 

6 

The specifically primed library (1 x 10 
recombinants) was screened for hybridization to the 831 bp 
Smal fragment from NMDAR2D (nucleotides 2435-3265 of 
Sequence ID No. 15) in 5X SSPE, 5X Denhart's solution, 50% 
deionized formamide, 0.2% SDS, 200 ng/ml sonicated, 
denatured herring sperm DNA at 42°C. Washes were performed 
in 0.1X SSPE, 0.2% SDS at 65°C. One probe hybridized to 11 
plaques . 

Eleven of the hybridizing plagues were purified, 
and the inserts characterized by restriction enzyme mapping 
and DNA sequence analysis. Six of the clones (NMDA111, 
NMDA112, NMDA115, NMDA116, NMDA119 and NMDA121) contain the 
translation initiation codon and varying amounts of 5' 
untranslated sequence. 



The sequences of these clones overlap with NMDA96 
to constitute 100% of the human NMDAR2D subunit coding 
sequence (see nucleotides 485-4495 of Sequence ID No. 15) . 
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The full-length hNMDAR2D construct was prepared 
using NMDA1 15 and NMDA96 cDNAs . NMDA115 and NMDA96 cDNAs 
are already in the pBlueScript vector, however the NMDA115 
cDNA is in the sense orientation from the T7 promoter, 
5 while the NMDA96 cDNA is in the antisense orientation. For 
ease of subcloning the full-length construct, the NMDA96 
cDNA was cloned into the sense orientation by digesting 
NMDA96 with EcoRl and screening the resulting clones for 
orientation (NMDAR96-T7) . Within the complete human 

10 NMDAR2D sequence, there is a unique Hindlll at nucleotides 
2804 that was used to clone NMDA115 together with NMDA96. 
However, there is an additional Hindlll site in the pBS 
polylinker at the 5' end of the NMDA115 cDNA. Therefore 
NMDA115 was fully digested with Spel , a 3' polylinker site, 

15 and partially digested with Hindlll. The resulting -5.6 kb 
Spel-Hindlll fragment from pNMDA115 (pBS vector plus 
nucleotides 397-2804 of Sequence ID No. 15)) was ligated 
with the 1.7 kb Hindlll-Spel fragment (nucleotides 2805- 
4651 of Sequence ID No. 15) from NMDA96-T7 to form pBS- 

20 hNMDAR2D. In vitro transcripts were prepared for co- 
injection into Xenopus oocytes to test for alteration of 
NMDAR1A currents. 

The complete NMDAR2D insert is then transfered 
into the pMMTV-T7+ mammalian expression vector as a -4.7 kb 
25 EcoRV /Spel fragment. The £*coRV and Spel restriction sites 
are in the multiple cloning region of the pBluscript 
vector . 

In summary, construct NMDAR2D contains 88 base 
pairs of 5' untranslated sequence (nucleotides 397-484 in 
30 Sequence ID No. 15), the complete coding sequence for the 
NMDAR2D subunit (nucleotides 484-4495 of Sequence ID No. 
15) as well as 200 base pairs of 3 1 untranslated sequence 
(nucleotides 4496-4695 of Sequence ID No. 15) . The NMDAR2D 
subunit encoding sequence is operatively linked to the 
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regulatory elements in pMMTV-T7 for expression in mammalian 
cells . 

Example 9 

Expression of Recombinant Human NMDA 
5 Receptor Subunits on Oocytes 

Xenopus oocytes were injected with in vitro 
transcripts prepared from constructs containing DNA 
encoding human NMDA receptor NMDAR1 and NMDAR2 subunits. 
Electrophysiological measurements of the oocyte 
10 transmembrane currents were made using the two-electrode 
voltage clamp technique (see e.g., Stuhmer (1992) Meth. 
Enzymol* 207:319-339). 

A. Preparation of In Vitro Trans cripts 



Recombinant capped transcripts of NMDA receptor 
15 subunit cDNAs contained in constructs NMDAR1A, NMDAR1-I63, 
NMDAR1-I63-A204, NMDAR1-A1087 , NMDAR1-A363, and pCMV-26- 
NotI-24 were synthesized from linearized plasmids using the 
mCAP RNA Capping Kit (Cat. #200350, Stratagene, Inc., La 
Jolla, CA) . For experiments in which NMDAR2A or NMDAR2B 
20 and NMDAR1 or NMDAR1-I63 transcripts were co-injected into 
Xenopus oocytes, the transcripts were synthesized from 
linearized constructs NMDAR1A , NMDAR1-I63, pCMV- 
hNMDAR2A-3 (53) , pCMV-2 6-Not 1-2 4 and pBS-hNMDAR2B using 
mMessage mMachine (Ambion, catalog #134 4 , Austin, TX) . The 
25 mass of each synthesized transcript was determined by UV 
absorbance and the integrity of each transcript was 
determined by electrophoresis through an agarose gel. 

B. ElectrophvsiolQQV 
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Xenopus oocytes were injected with 12.5-50 ng of 
one or more NMDA receptor subunit transcripts per oocyte. 
The preparation and injection of oocytes were carried out 



as described by Dascal [(1987) Crit . Rev. Biochem. 22:317- 
387]- Two-to-six days following mRNA injection, the 
oocytes were examined using the two-electrode voltage clamp 
technique. The cells were bathed in Ringer's solution (115 
mM NaCl, 2.5 mM KCl, 1.8 mM CaCl 2 , 10 mM HEPES , pH 7.3) , and 
the membrane potential was clamped at -80 to -100 mV. 
Drugs were applied by pipetting 6.0 /il aliquots of drug- 
containing solution directly into the bath, or by using 
gravity-feed into a Warner Instruments chamber (volume = 
110 /il) at a fl° w rate of 8 ml/min. The data were sampled 
at 2-5 Hz with a Labmaster data acquisition board in a PC- 
386 using AXOTAPE version 1.2 (Axon Instruments, Foster 
City, CA) software. The data were exported to a laser 
printer or plotted using Sigmaplot version 5.0. 

NMDA agonists, i.e., 10-30 /zM glycine (gly) and 
10-100 /iM glutamate (glu) or 100-1000 mM NMDA , were applied 
to the bath. If a current response was observed, the 
agonists were washed from the bath and 0.1-1.0 mM MgCl 2 or 
1 /iM MK8 01 (Research Biochemicals , Inc., Natick, MA) (NMDA 
receptor antagonists) were applied before a second agonist 
application in order to determine whether the current was 
blocked by antagonists. Alternatively, MgCl 2 or MK-801 were 
applied during agonist-induced current flow. The results 
of multiple recordings are summarized in Table 1. 
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Analysis of the results shown in Table 1 
indicates that, in general, the NMDA agonist-induced 
currents were blocked by either MgCl 2 or MK801. 



Oocytes injected with transcripts (12.5 to 65 ng) 
5 of the NMDAR-1 subunit-encoding inserts of constructs 
NMDAR1A, NWDAR1-I63 or NMDAR1-A363 were further analyzed to 
evaluate human NMDA receptor sensitivity to glutamate and 
NMDA. The two-electrode voltage clamp methods described 
above were used to measure current in the cells. 



10 To determine glutamate and NMDA sensitivity of 

the recombinant human NMDA receptors, various 
concentrations of glutamate (0.1 - 100 /iM) or NMDA (3-1000 
jiM) were applied to the bath (in the presence of 10-30 /iM 
glycine) and the current response was recorded. The bath 

15 was flushed between agonist applications. Intermediate 
test applications of 10 /iM glycine plus 10 /iM glutamate 
were included in the experiments to monitor the receptors 
for run-down (i.e., inactivation of receptors that have 
been repeatedly activated during prolonged 

20 electrophysiological recording) . The data were used to 
generate dose-response curves from which EC 50 values for the 
two agonists were calculated. Glycine sensitivity was 
determined in the same manner except that various 
concentrations (0.1-100 /iM) of glycine were co-applied with 

25 100 AiM NMDA. 



The EC 50 values determined for glutamate 
stimulation of NMDA receptors expressed in oocytes injected 
with NMDAR1A, NMDAR1-I63 or NMDAR1-A3 63 transcripts were 
0.4, 0.6 and 0.5 /iM, respectively. The EC 50 values 
30 determined for NMDA stimulation of NMDA receptors expressed 
in oocytes injected with NMDAR1A , NMDAR1-I63 or NMDAR1-A3 6 3 
transcripts were 6.3, 10.9 and 11.9 /iM, respectively. 
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There was a marked potentiation of the current 
magnitude in response to glutamate and glycine in oocytes 
co-injected with in vitro transcripts of pCMV- 
hNMDAR2A-3 (53) and NMDAR1A or NMDAR1-I63 compared to the 
5 currents recorded in oocytes injected with transcripts of 
either NMDAR1A or NMDAR1-I63 alone. Similarly, there was 
a marked potentiation of the current magnitude in response 
to glutamate and glycine in oocytes co-injected with in 
vitro transcripts of NMDAR1A and pBS-hNMDAR2B compared to 
10 the currents recorded in oocytes injected with only the 
NMDAR1A transcript. 

To investigate the pharmacological properties of 
human NMDA receptors generated by coexpression of the human 
NMDAR1A, NMDAR2A and NMDAR2C subunits, oocytes were co- 

15 injected with 50 ng each of in vitro transcripts prepared 
from the NMDAR1A, pCMV-hNMDAR2A-3 , and pCMV-2 6-NotI-24 
(NMDAR2C) constructs. The sensitivity of the recombinant 
heteromeric receptors to glycine and NMDA was determined as 
described above. The EC 50 for glycine activation of inward 

20 currents in these recombinant oocytes was calculated from 
the dose-response curve to be 0.87 + 0.24 /iM (mean ± S.D. 
of 4 oocytes) , which was significantly different than the 
EC 50 calculated for glycine sensitivity of oocytes injected 
with 50 ng each of in vitro transcripts of NMDAR1A and 

25 pCMV— hNMDAR2 A-3 alone (1.9 ± 0.26 fiM , ; p = 0.0002, one- 
tailed t-test) . The sensitivity to NMDA also increased 
when human NMDAR2C was co-expressed with human NMDAR1A and 
NMDAR2A subunits. The EC 50 for NMDA was shifted from 30.2 
± 9.4 £iM for oocytes co-injected with 50 ng each of in 

30 vitro transcripts of NMDAR1A and pCMV-hNMDAR2 A-3 to 11.9 ± 
5.2 for oocytes co-injected with 50 ng each of in vitro 
transcripts of NMDAR1A , pCMV-hNMDAR2 A-3 and pCMV-2 6-NotI-24 
(mean ± S.D. of 4 oocytes). 



Example 10 

Recombinant Expression of Human NMDA Receptor Subuni 

in Mammalian Cells 



Mammalian cells, such as human embryonic kidney 
(HEK293) cells can be transiently and/or stably transfected 
with DNA encoding human NMDA receptor subunits (e.g., DNA 
encoding an NMDAR1 subunit or DNA encoding an NMDAR1 
subunit and DNA encoding an NMDAR2 subunit such as pCMV-26- 
NotI-24, pCMV-hNMDAR2A-3 (53) or pCMVPL3 -hNMDAR2 B ) . 

Transf ectants are analyzed for expression of NMDA receptors 
using various assays, e.g., northern blot hybridization, 
electrophysiological recording of cell currents, ca 
sensitive fluorescent indicator-based assays and [ H]-MK801 
binding assays. 

A. Transient Transfection of HEK Cells 

Two transient transf ections were performed. In 
one transfection, HEK 293 cells were transiently 
transfected with DNA encoding an NMDAR1 (construct NMDAR1A) 
subunit. In another transfection, HEK 293 cells were 
transiently co-transf ected with DNA encoding NMDAR1 
(construct NMDAR1A) and NMDAR2C ( pCMV-2 6-NotI - 2 4 ) subunits. 
In both transf ections, ~2 x 10 6 HEK cells were transiently 
transfected with 19 fig of the indicated plasmid(s) 
according to standard CaPO ; transfection procedures [Wigler 
et al. (1979) Proc . Natl. Acad. Sci . USA 76:1373-1376]. In 
addition, 1 y.q of plasmid pCMV/?gal (Clontech Laboratories, 
Palo Alto, CA) , which contains the Escherichia coli 
/?-galactosidase gene fused to the CMV promoter, were 
co-transf ected as a reporter gene for monitoring the 
efficiency of transfection. The transf ectants were 

analyzed for /?-galactosidase expression by direct staining 
of the product of a reaction involving /?-galactosidase and 
the X-gal substrate [Jones (1986) EMBO 5:3133-3142]. 
Transf ectants can also be analyzed for £-galactosidase 
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expression by measurement of £-galactosidase activity 
[Miller (1972) in Experiments in Molecular Genetics, 
pp. 352-355, Cold Spring Harbor Press]. 

The efficiency of these transf ections of HEK 
cells was typical of standard efficiencies (i.e., -50%). 

B. Stable Transfection of Mamm alian Cells 

Mammalian cells, such as HEK 293 cells, can be 

stably transfected using the calcium phosphate transfection 

procedure [Current Protocols in Molecular Biology, Vol. 1, 

Wiley Inter-Science, Supplement 14, Unit 9.1.1-9.1.9 

6 

(1990)]. Ten-cm plates, each containing 1-2 x 10 cells, 
are transfected with 10 ml of DNA/calcium phosphate 
precipitate in media containing approximately 19 Mg of NMDA 
receptor subunit-encoding DNA and 1 M9 of DNA encoding a 
selectable marker, for example, neomycin-resistance gene 
(i.e., pSV2neo) . After -14 days of growth in media 
containing typically 1 Mg/™1 G418, colonies form and are 
individually isolated using cloning cylinders. The 
isolates are then subjected to limiting dilution and 
screened to identify those that express NMDA receptors 
using, for example, methods described below. 



4 
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C. Analysis of Transf ectants 

1 . Northern Blot Hybridization Analysis 

Total RNA was isolated from -1 x 10 7 HEK cells 
co-transf ected with NMDAR1 and pCMV-2 6-NotI-2 4 , and 5-10 ^9 
5 of RNA was used - for northern hybridization analysis. 
Fragments from human neuronal NMDAR subunit-encoding 
plasmids were randomly primed and labeled with P-dCTP 
Klenow incorporation and used as probes. The northern blot 
hybridization and wash conditions were as follows: 

10 hybridization in 5x SSPE, 5X Denhart ' s 

solution, 50% formamide, at 42°C 
followed by washing in 0.2x SSPE, 
0.1% SDS, at 65°C. 

Results of these studies revealed the 
15 transf ectants expressed detectable levels of NMDAR1 and 
NMDAR2C mRNA of the appropriate size (based on the size of 
the cDNAs) . 

2 . Fluorescent indicator-based assays 

Activation of ligand-gated NMDA receptors by 
20 agonists leads to an influx of cations (both monovalent and 
divalent), including Ca 2+ , through the receptor channel. 
Calcium entry into the cell through the channel can in turn 
induce release of calcium contained in intracellular 
stores. Monovalent cation entry into the cell through the 
25 channel can also result in an increase in cytoplasmic 
calcium levels through depolarization of the membrane and 
subsequent activation of voltage-dependent calcium 
channels. Therefore, methods of detecting transient 
increases in intracellular calcium concentration can be 
30 applied to the analysis of functional NMDA receptor 
expression. One method for measuring intracellular calcium 
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levels relies on calcium-sensitive fluorescent 
indicators . 

Calcium-sensitive indicators, such as fluo-3 
(Catalog No. F-1241, Molecular Probes, Inc., Eugene, OR) 
5 are available as acetoxymethy 1 esters which are membrane 
permeable. When- the acetoxymethy 1 ester form of the 
indicator enters a cell, the ester group is removed by 
cytosolic esterases, thereby trapping the free indicator in 
the cytosol. Interaction of the free indicator with 

10 calcium results in increased fluorescence of the indicator; 
therefore, an increase in the intracellular Ca 2 * 
concentration of cells containing the indicator can be 
expressed directly as an increase in fluorescence. An 
automated fluorescence detection system for assaying NMDA 

15 receptors has been described in commonly assigned pending 
US Patent Application No. 07/812,254 and corresponding PCT 
Patent Application No. US92/11090, incorporated by 
reference herein in their entirety. 



Mammalian cells that have been transfected with 
20 DNA encoding NMDAR1 or NMDAR1 and NMDAR2 subunits can be 
analyzed for expression of functional recombinant NMDA 
receptors using the automated fluorescent indicator-based 
assay. The assay procedure is as follows. 



Untransf ected mammalian host cells (or host cells 
25 transiently transfected with pCMV-T7-2) and mammalian cells 
that have been transfected with NMDAR1 ± NMDAR2 subunit DNA 
are plated in the wells of a 96-well microtiter dish (Nunc 
Catalog No. 1-6708, available through Alameda Industries, 
Escondido, CA) that has been precoated with poly-L-lysine 
30 at a density of 2.5 x 10 5 cells/well and loaded with fluo-3 
by incubation for 2 hours at 20°C in a medium containing 20 
MM fluo-3, 0.2% Pluronic F-127 in HBS (125 mM NaCl, 5 mM 
KCl, 1.8 mM CaCl 2 , 0.62 mM MgCl 2 , 20 mM glucose, 20 mM 
HEPES, pH 7.4). The cells are then washed with assay 
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buffer (i.e. HBS) . The microtiter dish is then placed into 
a fluorescence plate reader (e.g., Fluoroskan II, Lab 
Products International, Ltd., Raleigh, NC) and the basal 
fluorescence of each well is measured and recorded before 
5 addition of 10 glycine and 10 jxM glutamate to the wells. 
The fluorescence of the wells is monitored repeatedly (75 
readings at 0.63-sec intervals) following addition of 
agonist . 

The fluorescence of the untransf ected host cells 
10 preferably will not change after addition of glycine and 
glutamate, i.e., the host cells should not express 
endogenous excitatory amino acid receptors. The 
fluorescence of mammalian cells transfected with NMDAR1 ± 
NMDAR2 subunit DNA will increase after addition of glycine 
15 and glutamate if a sufficient number of functional NMDA 
receptors are expressed at the cell surface, and 
fluorescence readings are taken rapidly. 

The resting potential of the membrane of some 
mammalian host cells may be relatively positive (e.g., -35 

20 mV) . Because activation of some NMDA receptors may be 
significantly reduced at relatively positive potentials, it 
may be necessary to lower the resting potential of the 
membrane of cells transfected with human NMDA receptor 
subunit-encoding DNAs prior to assaying the cells for NMDA 

25 receptor activity using the fluorescent indicator-based 
assay. This may be accomplished by adding valinomycin (-10 
/iM) to the transfected cells prior to adding NMDA receptor 
agonists to initiate the assay. 

3 . NMDA Receptor Ligand Binding Assays 



30 Mammalian cells transfected with NMDAR1 ± NMDAR2 

subunit DNAs can be analyzed for [ 3 H]-MK801 binding. An 
additional ligand-binding assay for NMDA receptors using 
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3 H-CGP39653 is also described below. Rat brain membranes 
are included in the binding assays as a positive control. 



a . Preparation of Membranes 

i „ Buffv coat Homoqenate from Rat 
5 Cerebral Cortex 

Buffy coat membranes are prepared from rat brain 
cortices as described by Jones et al . [(1989) J. Pharmacol. 
Meth. 22:161]. Briefly, cortices from ten freshly thawed 
frozen rat brains are dissected and weighed. The tissue is 

10 homogenized in 20 volumes of 0.32 M ice-cold sucrose in a 
glass homogenizing tube using a Teflon pestle. The 
suspension is centrifuged at 1,000 x g for 10 minutes at 
4°C. The supernatant is decanted and centrifuged at 20,000 
x g for 20 minutes at 4°C. The pellet is resuspended in 20 

15 volumes of ice-cold distilled water with a Polytron for 30 
sec at setting 6. The suspension is centrifuged at 3,000 
x g for 20 minutes at 4°C. The buffy coat pellet is rinsed 
gently with supernatant and then recentrif uged at 48,000 x 
g for 20 minutes at 4°C. The pellet is resuspended in 20 

20 volumes of ice-cold distilled water with a Polytron and 
centrifuged again at 48,000 x g for 20 minutes. The wash 
step is repeated once more. The final suspension is 
divided into aliquots, centrifuged. Each pellet can be 
stored frozen at -20°C for 12 hrs or more before use. 



25 ii. Membranes from Transfected and 

Untransf ected Mammalian Cells 

In order to prepare membranes from transfected 
and untransf ected mammalian cells, the cells are scraped 
from the tissue culture plates, and the plates are rinsed 
30 with 5 ml of PBS (phosphate-buffered saline: 137 mM NaCl, 
2.7 mM KC1, 10 mM Na 2 HP0 4 , 1.7 mM KH 2 P0J . The cells are 
centrifuged at low speed in a table-top centrifuge, and the 
cell pellet is rinsed with PBS. The cell pellet is 
resuspended in 20 ml of 10 mM Hepes buffer, pH 7.4, using 
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a Polytron at setting 3-6 for 30 seconds. The cell 
suspension is centrifuged at 48,000 x g for 20 minutes at 
4°C. The supernatant is discarded, and the pellet is kept 
frozen for 12 hrs or more at -20°C. 

5 b. (" 3 H1-MK801 Binding to NMDA Receptors 

The binding of [ 3 H]-MK801 to NMDA receptors is 
carried out as described by Wong et al . [(1986) Proc . Natl. 
Acad. Sci. USA 83:7104], with a few minor changes. Thus, 
on the day of the assay, the rat brain and mammalian cell 

10 (transfected and untransf ected) membrane pellets are 
resuspended in 50 volumes of 10 mM Hepes buffer, pH 7.4, 
using a 10-ml syringe and a 21-gauge needle, and incubated 
for 20 minutes at 37 °C. The supernatant is centrifuged at 
48, 000 x g for 20 minutes at 4°C. The pellet is 

15 resuspended in 2 ml of 10 mM Hepes, pH 7 . 4 and centrifuged 
as described above. The wash step is repeated once more, 
and the pellet is resuspended in 10 ml of 10 mM Hepes, pH 
7.4. The protein concentration is determined using the 
Biorad Bradford reagent. The pellet is finally resuspended 

20 in the assay buffer (10 mM Hepes, pH 7 . 4 ) at 1 mg/ml. 



For binding studies, the membrane suspension is 
incubated in duplicate with 2.5 nM [ 3 H]-MK801 (New England 
Nuclear, Boston, MA) in a total volume of 0.5 ml assay 
buffer (10 mM Hepes, pH 7 . 4 ) in the presence and absence of 

25 10 fiK glutamate and 10 juM glycine for 60 or 120 min at 
23 °C. Bound radioactivity is separated from free 

radioactivity by rapid filtration through Whatman GF/C 
filters which are presoaked for 2-3 hrs in 0.05% 
polyethylenimine . The filters are washed twice with 3 ml 

30 ice-cold assay buffer. The filters are dried and 

transferred to scintillation vials, each containing 10 ml 
of scintillation fluid. The vials are vortexed, and the 
radioactivity is measured in a Beckman scintillation 
counter. The nonspecific binding observed in the presence 
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of 10 /iM MK801 is subtracted from the total binding in 
order to determine the specific binding. 

Rat brain cortical buffy coat membranes displayed 
specific saturable binding of [ 3 H]-MK801. In the presence 
5 of glycine and glutamate, the ratio of total-to-nonspecific 
binding (S:N ratio)- was 28:1, whereas in the absence of 
glutamate and glycine the S:N ratio was 5:1. Thus, the 
binding of MK801 to rat NMDA receptors is potentiated by 
glutamatergic agonists. Scatchard analysis of [ H]-MK801 
10 binding to rat brain membranes indicated that the 
sensitivity of the assay was 90 fmoles of receptor. 

c. f 3 H1-CGP39653 Binding to NMDA Receptors 

The binding of [ 3 H] -CGP39653 to rat brain 
membranes is carried out as described by Sills et al . 

15 [(1991) Eur. J. Pharmacol. 292:19]. The buffy coat 
membrane pellet is resuspended in 50 volumes of 5 mM Tris- 
HC1 containing 10 mM EDTA, pH 7.7, and incubated for 10 
min. at 37°C. The supernatant is centrifuged at 48,000 x 
g for 10 min. at 4°C. The wash step is repeated once and 

20 the pellet is resuspended in 10 ml of 5 mM Tris-HCl 
containing 10 mM EDTA, pH 7.7. This rat brain membrane 
suspension is incubated in duplicate or triplicate with 2.0 
nM [ 3 H] -CGP39653 (New England Nuclear) in a total volume of 
0.5 ml assay buffer (5 mM Tris-HCl, pH 7.7) for 60 min at 

25 0°C. Nonspecific binding is determined in the presence of 
100 }J.M glutamate. Bound radioactivity is separated from 
the free by vacuum filtration through GF/C filters which 
are presoaked for 2-3 hrs in 0.05% polyethy lenimine , using 
the filtration manifold. Unbound radioactivity is removed 

30 with two washes of 3 ml each of ice-cold buffer. The 
filters are dried and transferred to scintillation vials, 
each containing 10 ml of scintillation fluid. The vials are 
vortexed, and the radioactivity is measured in a Beckman 
scintillation counter. The nonspecific binding observed in 
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the presence of 100 fj.M glutamate is subtracted from the 
total binding to determine the specific binding. 

[ 3 H] -CGP39653 binding was first measured as a 
function of membrane concentration. Specific binding 
increased linearly with increasing membrane concentration 
up to 200 fig of -protein in the presence of 2 nM [ H]- 
CGP39653 . 

Saturation analysis of [ 3 H] -CGP39653 binding was 
carried out by incubating 150 M9 of rat buffy coat 
homogenate with increasing concentrations of [ H]-CGP39653 
for 60 min~ at 4°C. Scatchard analysis indicated a single 
class of binding sites with a value of 0.69 ± 0.09 

pmoles/mg and a K d value of 12.3 ± 0.12 nM . 

While the invention has been described in detail 
with reference to certain preferred embodiments thereof, it 
will be understood that modifications and variations are 
within the spirit and scope of that which is described and 
claimed . 
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Summary of Sequences 

Sequence ID No. 1 is a nucleotide sequence 
encoding a human N-methyl-D-aspartate (NMDA) receptor 
subunit, NMDAR1A , and the deduced amino acid sequence 
5 thereof. 

Sequence ID No. 1A is a 3083 nucleotide sequence 
encoded by clone NMDA10, comprising nucleotides 320 - 3402 
of Sequence ID No. 1. Thus, Sequence ID No. 1A differs 
from Sequence ID No. 1 in that it does not contain the 319 
10 5' nucleotides, nor the 896 3' nucleotides thereof. 

Sequence ID No. IB is a 3155 nucleotide sequence 
encoded by clone NMDA11 , comprising nucleotides 1 - 2961, 
plus nucleotides 3325 - 3518 of Sequence ID No. 1. Thus, 
Sequence ID No. IB differs from Sequence ID No. 1 by the 
15 deletion of 363 nucleotides from the 3' portion thereof 
(i.e., by the deletion of nucleotides 2962 - 3324 of 
Sequence ID No. 1) , and further by the lack of the 781 
terminal 3' nucleotides of Sequence ID No. 1. 

Sequence ID No. 1C is a 2542 nucleotide sequence 
20 encoded by clone NMDA7 , comprising nucleotides 556 - 831 of 
Sequence ID No. 1, plus an additional 63 nucleotides (set 
forth in Sequence ID No. 3) and nucleotides 832 - 984, 1189 
- 2961 and 3325 - 3599 of Sequence ID No. 1. Thus, 
Sequence ID No. 1C differs from Sequence ID No. 1 in that 
25 it does not contain the 555 5 1 -most nucleotides thereof, it 
does not contain the 204 nucleotides set forth as 
nucleotides 985 - 1188 of Sequence ID No. 1, it does not 
contain the 363 3 1 nucleotides set forth as nucleotides 
2962 - 3324 of Sequence ID No. 1, and it does not contain 
30 the 700 3 1 -most nucleotides of Sequence ID No. 1, while it 
does contain an additional 63 nucleotides (Sequence ID No. 
3) inserted between nucleotides 831 and 832 of Sequence ID 
No. 1. 
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Sequence ID No. ID is a 593 nucleotide sequence 
encoded by clone NMDA3 , comprising nucleotides 2617 - 2961, 
plus nucleotides 4049 - 4298 of Sequence ID No. 1. Thus, 
Sequence ID No. ID differs from Sequence ID No. 1 in that 
it does not contain the 2616 5' nucleotides thereof, and by 
the deletion of 1087 nucleotides from the 3' portion 
thereof (i.e., by the deletion of nucleotides 2962 - 4048 
of Sequence ID No. 1) . 

Sequence ID No. IE is a nucleotide sequence 
encoding human NMDA receptor subunit NMDAR1-A363, 
comprising nucleotides 1 - 2961, plus nucleotides 3325 - 
4298 of Sequence ID No. 1. Thus, Sequence ID No. IE 
differs from Sequence ID No. 1 in that it does not contain 
the 363 nucleotides set forth as nucleotides 2962 - 3324 of 
Sequence ID No. 1. 

Sequence ID No. IF is a nucleotide sequence 
encoding human NMDA receptor subunit NMDAR1-A1087 , 
comprising nucleotides 1 - 2961, plus nucleotides 4049 - 
4298 of Sequence ID No. 1. Thus, Sequence ID No. IF 
differs from Sequence ID No. 1 in that it does not contain 
the 1087 nucleotides set forth as nucleotides 2962 - 4048 
of Sequence ID No. 1. 

Sequence ID No. 1G is a nucleotide sequence 
encoding human NMDA receptor subunit NMDAR1-I63. Sequence 
ID No. 1G is the same as Sequence ID No. 1, further 
comprising an additional 63 nucleotides (set forth in 
Sequence ID No. 3) inserted between nucleotides 831 and 832 
of Sequence ID No . 1 . 

Sequence ID No. 1H is a nucleotide sequence 
encoding human NMDA receptor subunit NMDAR1-I63-A204 . 
Sequence ID No. 1H is the same as Sequence ID No. 1G, 
except Sequence ID No. 1H does not contain the 204 
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nucleotides set forth as nucleotides 985 - 1188 of Sequence 
ID No. 1. 

Sequence ID No. II is a nucleotide sequence 
encoding human NMDA receptor subunit NMDAR1-I63-A204-A363 . 
5 Sequence ID No. II is the same as Sequence ID No. 1H, 
except Sequence ID No. II does not contain the 363 
nucleotides set forth as nucleotides 2962 - 3324 of 
Sequence ID No. 1. 

Sequence ID No. 1J is a nucleotide sequence 
10 encoding human NMDA receptor subunit NMDAR1-A204 . Sequence 
ID No. 1J is the same as Sequence ID No. 1, except Sequence 
ID No. 1J does not contain the 204 nucleotides set forth as 
nucleotides 985 - 1188 of Sequence ID No. 1. 

Sequence ID No. IK is a nucleotide sequence 
15 encoding human NMDA receptor subunit NMDAR1-A2 04-A3 63 . 
Sequence ID No. IK differs from Sequence ID No. 1 in that 
Sequence ID No. IK does not contain the 204 nucleotides set 
forth as nucleotides 985 - 1188 of Sequence ID No. 1, nor 
the 363 nucleotides set forth as nucleotides 2962 - 3324 of 
20 Sequence ID No. 1. 

Sequence ID No. 1L is a nucleotide sequence 
encoding human NMDA receptor subunit NMDAR1-A204-A1087 . 
Sequence ID No. 1L differs from Sequence ID No. 1 in that 
Sequence ID No. 1L does not contain the 204 nucleotides set 
25 forth as nucleotides 985 - 1188 of Sequence ID No. 1, nor 
the 1087 nucleotides set forth as nucleotides 2962 - 4048 
of Sequence ID No. 1. 

Sequence ID No. 1M is a nucleotide sequence 
encoding human NMDA receptor subunit NMDAR1-I63-A3 63 . 
30 Sequence ID No. 1M is the same as Sequence ID No. 1G except 
Sequence ID No. 1M does not contain the 363 nucleotides set 
forth as nucleotides 2962 - 3324 of Sequence ID No. 1. 



83 

Sequence ID No. IN is a nucleotide sequence 
encoding human NMDA receptor subunit NMDAR1-I 63 -A1087 . 
Sequence No. IN is the same as Sequence ID No. 1G except 
Sequence ID No. IN does not contain the 1087 nucleotides 
5 set forth as nucleotides 2962 - 4048 of Sequence ID No. 1. 

Sequence ID No. IP is a nucleotide sequence 
encoding human NMDA receptor subunit NMDAR1-I63-A204-A1087 . 
Sequence ID No. IP is the same as Sequence ID No. 1H, 
except Sequence ID No. IP does not contain the 1087 
10 nucleotides set forth as nucleotides 2962 - 4048 of 
Sequence ID No. 1. 

Sequence ID No. 2 is the amino acid sequence of 
the NMDA receptor subunit set forth in Sequence ID No. 1. 

Sequence ID No. 2A is the amino acid sequence of 
15 a portion of an NMDA receptor subunit as encoded by the 
nucleotide sequence of Sequence ID No. 1A. 

Sequence ID No. 2B is the amino acid sequence of 
an NMDA receptor subunit encoded by the nucleotide sequence 
of Sequence ID No. IB. 

20 Sequence ID No. 2C is the amino acid sequence of 

a portion of an NMDA receptor subunit as encoded by the 
nucleotide sequence of Sequence ID No. 1C. 

Sequence ID No. 2D is the amino acid sequence of 
a portion of an NMDA receptor subunit as encoded by the 
25 nucleotide sequence of Sequence. ID No. ID. 



Sequence ID No. 2E is the amino acid sequence of 
an NMDA receptor subunit encoded by the nucleotide sequence 
of Sequence ID No. IE. 
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Sequence ID No. 2F is the amino acid sequence of 
an NMDA receptor subunit encoded by the nucleotide sequence 
of Sequence ID No. IF. 

Sequence ID No. 2G is the amino acid sequence of 
5 an NMDA receptor subunit encoded by the nucleotide sequence 
of Sequence ID No. 1G. 

Sequence ID No. 2H is the amino acid sequence of 
an NMDA receptor subunit encoded by the nucleotide sequence 
of Sequence ID No. 1H. 

10 Sequence ID No. 21 is the amino acid sequence of 

an NMDA receptor subunit encoded by the nucleotide sequence 
of Sequence ID No. II. 

Sequence ID No. 2 J is the amino acid sequence of 
an NMDA receptor subunit encoded by the nucleotide sequence 
15 of Sequence ID No. 1J. 

Sequence ID No. 2K is the amino acid sequence of 
an NMDA receptor subunit encoded by the nucleotide sequence 
of Sequence ID No. IK. 

Sequence ID No. 2L is the amino acid sequence of 
20 an NMDA receptor subunit encoded by the nucleotide sequence 
of Sequence ID No. 1L. 

Sequence ID No. 2M is the amino acid sequence of 
an NMDA receptor subunit encoded by the nucleotide sequence 
of Sequence ID No. 1M. 



25 



Sequence ID No. 2N is the amino acid sequence of 
an NMDA receptor subunit encoded by the nucleotide sequence 
of Sequence ID No. IN T 
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Sequence ID No. 2P is the amino acid sequence of 
an NMDA receptor subunit encoded by the nucleotide sequence 
of Sequence ID No. IP. 



Sequence ID No. 3 is a nucleotide sequence 
5 encoding the 63 nucleotide insert present in Sequence ID 
Nos. 1C, 1G, 1H, II, 1M, IN and IP. 

Sequence ID No. 4 is the 21 amino acid sequence 
encoded by the insert set forth in Sequence ID No. 3. 

Sequence ID No. 5 is a nucleotide sequence of a 
10 clone ( P CMV-2 6-NotI-24) encoding a human ~ N-methyl- 
D-aspartate (NMDA) receptor subunit, NMDAR2C, and the 
deduced amino acid sequence thereof. 

Sequence ID No. 5A is a 2026 nucleotide sequence 
encoded by clone NMDA21, comprising nucleotides 931 - 2350, 
15 and 2402 - 3307 of Sequence ID No. 5. Thus, Sequence ID 
No. 5A differs from Sequence ID No. 5 in that it does not 
contain the 930 5' nucleotides thereof, nor the 51 
nucleotides located at position 2351 - 2401 of Sequence ID 
No. 5, nor the 1061 3' nucleotides of Sequence ID No. 5. 

20 Sequence ID No. 5B is a 3698 nucleotide sequence 

encoded by clone NMDA22, comprising nucleotides 367 - 1300 
of Sequence ID No. 5, plus an additional 11 nucleotides 
(set forth as Sequence ID No. 9), and nucleotides 1301 - 
1959 and 1975 - 4068 of Sequence ID No. 5. Thus, Sequence 
25 ID No. 5B differs from Sequence ID No. 5 by the lack of the 
366 5 '-most nucleotides, by the insertion of 11 nucleotides 
between nucleotides 1300 and 1301 of Sequence ID No. 5, and 
further by the lack of the 15 nucleotides of Sequence ID 
No. 5 from residue 1960 to residue 1974. 
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Sequence ID No. 5C is a 3243 nucleotide sequence 
encoded by clone NMDA24, comprising nucleotides 861 - 1300 
of Sequence ID No. 5, plus an additional 11 nucleotides 
(Sequence ID No. 9), nucleotides 1301 - 2350 of Sequence ID 
No. 5, an additional 24 nucleotides (set forth as Sequence 
ID No. 7) and nucleotides 2351 - 4068 of Sequence ID No. 5. 
Thus, Sequence ID No. 5C differs from Sequence ID No. 5 in 
that it does not contain the 860 5 ' -most nucleotides 
thereof, while it does contain an additional 11 nucleotides 
(Sequence ID No. 9) inserted between nucleotides 1300 and 
1301, plus an additional 24 nucleotides (Sequence ID No. 7) 
inserted between nucleotides 2350 and 2351 of Sequence ID 
No. 5. 



Sequence ID No. 5D is a 3025 nucleotide sequence 
encoded by clone NMDA26, comprising nucleotides 1 - 3025 of 
Sequence ID No. 5. Thus, Sequence ID No. 5D differs from 
Sequence ID No. 5 in that it does not contain the 1043 
3' -terminal nucleotides thereof. 



Sequence ID No. 5E is a nucleotide sequence 
20 encoding human NMDA receptor subunit pCMV-2 6-ScaI-24 , which 
differs from Sequence ID No. 5 only in the insertion of 24 
nucleotides (Sequence ID No. 7) between nucleotides 2350 
and 2351 of Sequence ID No. 5. 

Sequence ID No. 5F is a nucleotide sequence 
25 encoding human NMDA receptor subunit pCMV-2 6-ScaI-22 , which 
differs from Sequence ID No. 5 only in the deletion of 
nucleotides 1960 - 1974 of Sequence ID No. 5. 

Sequence ID No. 5G is a nucleotide sequence 
encoding human NMDA receptor subunit pCMV- 26 -Seal - 21-JVocI- 24 , 
30 which differs from Sequence ID No. 5 only in the deletion 
of nucleotides 2351 - 2401 of Sequence ID No. 5. 
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Sequence ID No. 5H is a nucleotide sequence 
encoding human NMD A receptor subunit NMDAR2C-A15-I24 . 
Sequence ID No. 5H is the same as Sequence ID No. 5F, 
except Sequence ID No. 5H further contains the 24 
nucleotide insert set forth in Sequence ID No. 7, 
positioned between nucleotides 2350 and 2351 of Sequence ID 
No. 5. 

Sequence ID No. 51 is a nucleotide sequence 
encoding human NMDA receptor subunit NMDAR2C-A15-A51 . 
Sequence ID No. 51 is the same as Sequence ID No. 5G, 
except Sequence ID No. 51 does not contain the 15 
nucleotides' set forth as nucleotides 1960 - 1974 of 
Sequence ID No. 5. 

Sequence ID No. 6 is the amino acid sequence of 
15 the NMDA receptor subunit set forth in Sequence ID No. 5. 

Sequence ID No. 6A is the amino acid sequence of 
a portion of an NMDA receptor subunit as encoded by the 
nucleotide sequence of Sequence ID No. 5A. 



10 



20 



Sequence ID No. 6B is the amino acid sequence of 
a portion of an NMDA receptor subunit as encoded by the 
nucleotide sequence of Sequence ID No. 5B. 

Sequence ID No. 6C is the amino acid sequence of 
a portion of an NMDA receptor subunit as encoded by the 
nucleotide sequence of Sequence ID No. 5C. 

Sequence ID No. 6D is the amino acid sequence of 
a portion of an NMDA receptor subunit as encoded by the 
nucleotide sequence of Sequence ID No. 5D. 

Sequence ID No. 6E is the amino acid sequence of 
an NMDA receptor subunit encoded by the nucleotide sequence 
30 of Sequence ID No. 5E. 



25 
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Sequence ID No. h i s *. h _ aw . 

an NMDA receptor subunit „ Se " Uen « of 
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Plus the initiation T^ 1 *^ s ^ence of clone NMDA27, 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANT: Daggett, Lorrie P. 

Ellis, Steven B. 
Liaw, Chen W. 
Lu, Chin-Chun 

(ii) TITLE OF INVENTION: HUMAN N-METHY L-D-AS PARTATE RECEPTOR 
SUBUNITS, DNA ENCODING SAME AND USES THEREFOR 

(iii) NUMBER OF SEQUENCES: 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pretty, Schroeder, Brueggemann & Clark 

(B) STREET: 444 South Flower Street, Suite 2000 

(C) CITY: Los Angeles 

(D) STATE: CA 

(E) COUNTRY: U.S.A. 

(F) ZIP:~-90071-2921 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 20-APR-1994 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/052,449 

(B) FILING DATE: 20-APR-1993 

(viii) ATTORNEY /AGENT INFORMATION : 

(A) NAME: Reiter, Stephen E. 

(B) REGISTRATION NUMBER: 31,192 

(C) REFERENCE /DOCKET NUMBER: P41 9424 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619-54 6-473 7 

(B) TELEFAX: 619-546-9392 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4298 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 262.. 3078 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CAAGCCGGGC GTTCGGAGCT GTGCCCGGCC CCGCTTCAGC ACCGCGGACA GCGCCGGCCG 

CGTGGGGCTG AGCGCCGAGC CCCCGCGCAC GCTTCAGCCC CCCTTCCCTC GGCCGACGTC 

CCGGGACCGC CGCTCCGGGG GAGACGTGGC GTCCGCAGCC CGCGGGGCCG GGCGAGCGCA 

GGACGGCCCG GAAGCCCCGC GGGGGATGCG CCGAGGGCCC CGCGTTCGCG CCGCGCAGAG 

CCAGGCCCGC GGCCCGAGCC C ATG AGC ACC ATG CGC CTG CTG ACG CTC GCC 

Met Ser Thr Met Arg Leu Leu Thr Leu Ala 
1 5 10 

CTG CTG TTC TCC TGC TCC GTC GCC CGT GCC GCG TGC GAG CCC AAG ATC 
Leu Leu Phe Ser Cys Ser Val Ala Arg Ala Ala C y3 Asp Pro Lys 111 
15 20 25 

vll rT T o? C °? G GTG CTG AGC ACG CGG AAG CAC GAG CAG ATG TTC 

Val Asn Xle Gly Ala Val Leu Ser Thr Arg Lys His Glu Gin Met Phe 

30 _ 3 5 40 

CGC GAG GCC GTG AAC CAG GCC AAC AAG CGG CAC GGC TCC TGG AAG ATT 
Arg Glu Ala Val Asn Gin Ala Asn Lys Arg His Gly Ser Trp Lys 111 
45 50 55 

Gin lIu ^ GCC TCC GTC ACG CAC AAG CCC ** c GCC ATC CAG ATG 

Gin Leu Asn Ala Thr Ser Val Thr His Lys Pro Asn Ala He Gin Met 

60 65 70 

til Leu Ser vl? ^ ^ GAG ° TC ATC TCC AGC CAG G ™ TAG GCC ATC 
Ala Leu Ser Val Cys Glu Asp Leu He Ser Ser Gin Val Tyr Ala He 

80 85 go 

Hu Sal Ser CCC AAC GAC CAC TTC ACT CCC ACC CCT 

Leu Val Ser Hi.s Pro Pro Thr Pro Asn Asp His Phe Thr Pro Thr Pro 

95 100 105 

vll ^ A GCC GGC TTC TAC CGC ATA CCC OTG CTG GGG CTG ACC 

Val Ser Tyr Thr Ala Gly Phe Tyr Arg He Pro Val Leu Gly Leu Thr 

115 12 o 

?hr Jro Jer Ser l*° ° AC AAG AGG ATC CAC CTG AGC TTC CTG 

Thr Arg Met Ser He Tyr Ser Asp Lys Ser He His- Leu Ser Phe Leu 

130 135 

ArS ?£r 51? Pro T A ° l CC CAC CAG TCC AGC GTG TGG TT * OAG ATG 

Arg Thr Val Pro Pro Tyr Ser H iS Gin Ser Ser Val Trp Phe Glu Met 

145 150 

Set So SIf ? A ° AGC l GG AAC CAC ATC ATC CTG CTG GTC AGC GAC GAC 
Met Arg Val Tyr Ser Trp Asn His He He Leu Leu Val Ser Asp Asp 

160 165 * 17 5 

H?« ^ G CGG GCG GCT CAG *** CGC CTG GAG ACG CTG CTG GAG GAG 

His Glu Gly Arg Ala Ala Gin Lys Arg Leu Glu Thr Leu Leu Glu G^u 
175 180 18S 

Ara g5!u ITr- ^ GCA GAG ^ GTG CTG CAG TTT GAC CCA 0GG ACC AAG 
Arg Glu Ser Lys Ala Glu Lys Val Leu Gin Phe Asp Pro Gly Thr Lys 

190 196 200 

A^n vll A! 0 r° TG T CTG ATG GAG GCG *** GAG CTG OAG GCC CGG GTC 

Asn Val Thr Ala Leu Leu Met Glu Ala Lys Glu Leu Glu Ala Arg Val 

210 215 



60 
120 
180 
240 
291 

339 

387 

435 

483 

531 

579 

627 

675 



723 



771 



819 



867 



915 
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ATC ATC CTT TCT GCC AGC GAG GAC GAT GCT GCC ACT GTA TAC CGC GCA 963 

He He Leu Ser Ala Ser Glu Asd Asp Ala Ala Thr Val Tyr Arg Ala 

220 225 230 

GCC GCG ATG CTG AAC ATG ACG GGC TCC GGG TAC GTG TGG CTG GTC GGC 1011 
Ala Ala Met Leu Asn Met Thr Gly Ser Gly Tyr Val Trp Leu Val Gly 
235 240 ' 245 250 

GAG CGC GAG ATC TCG GGG AAC GCC CTG CGC TAC GCC CCA GAC GGC ATC 1059 
Glu Arg Glu He Ser Gly Asn Ala Leu Arg Tyr Ala Pro Asp Gly lie 
255 260 265 

CTC GGG CTG CAG CTC ATC AAC GGC AAG AAC GAG TCG GCC CAC ATC AGC 1107 
Leu Gly Leu Gin Leu He Asn Gly Lys Asn Glu Ser Ala His He Ser 
270 275 280 

GAC GCC GTG GGC GTG GTG GCC CAG GCC GTG CAC GAG CTC CTC GAG AAG 1155 
Asp Ala Val Gly Val Val Ala Gin Ala Val His Glu Leu Leu Glu Lys 
285 290 295 

GAG AAC ATC ACC GAC CCG CCG CGG GGC TGC GTG GGC AAC ACC AAC ATC 1203 
Glu Asn He Thr A~6p Pro Pro Arg Gly Cys Val Gly Asn Thr Asn He 
300 305 310 

TGG AAG ACC GGG CCG CTC TTC AAG AGA GTG CTG ATG TCT TCC AAG TAT 12 51 

Trp Lys Thr Gly Pro Leu Phe Lys Arg Val Leu Met Ser Ser Lys Tyr 
315 320 325 330 

GCG GAT GGG GTG ACT GGT CGC GTG GAG TTC AAT GAG GAT GGG GAC CGG 1299 
Ala Asp Gly Val Thr Gly Arg Val Glu Phe Asn Glu Asp Gly Asp Arg 
335 340 345 

AAG TTC GCC AAC TAC AGC ATC ATG AAC CTG CAG AAC CGC AAG CTG GTG 1347 
Lys Phe Ala Asn Tyr Ser He Met Asn Leu Gin Asn Arg Lys Leu Val 
350 355 360 

CAA GTG GGC ATC TAC AAT GGC ACC CAC GTC ATC CCT AAT GAC AGG AAG 1395 
Gin Val Gly He Tyr Asn Gly Thr His Val He Pro Asn Asp Arg Lys 
365 370 375 

ATC ATC TGG CCA GGC GGA GAG ACA GAG AAG CCT CGA GGG TAC CAG ATG 1443 
He He Trp Pro Gly Gly Glu Thr Glu Lys Pro Arg Gly Tyr Gin Met 
380 385 390 

TCC ACC AGA CTG AAG ATT GTG ACG ATC CAC CAG GAG CCC TTC GTG TAC 1491 
Ser Thr Arg Leu Lys He Val Thr He His Gin Glu Pro Phe Val Tyr 
395 400 405 410 

GTC AAG CCC ACG CTG AGT GAT GGG ACA TGC AAG GAG GAG TTC ACA GTC 1539 
Val Lys Pro Thr Leu Ser Asp Gly Thr Cys Lys Glu Glu Phe Thr Val 
415 420 425 

AAC GGC GAC CCA GTC AAG AAG GTG ATC TGC ACC GGG CCC AAC GAC ACG 1587 
Asn Gly Asp Pro Val Lys Lys Val He Cys Thr Gly Pro Asn Asp Thr 
430 435 440 

TCG CCG GGC AGC CCC CGC CAC ACG GTG CCT CAG TGT TGC TAC GGC TTT 1635 
Ser Pro Gly Ser Pro Arg His Thr Val Pro Gin Cys Cys Tyr Gly Phe 
445 450 455 

TGC ATC GAC CTG CTC ATC AAG CTG GCA CGG ACC ATG AAC TTC ACC TAC 1683 
Cys He Asp Leu Leu He Lys Leu Ala Arg Thr Met Asn Phe Thr Tyr 
460 465 470 

GAG GTG CAC CTG GTG GCA GAT GGC AAG TTC GGC ACA CAG GAG CGG GTG 1731 
Glu Val His Leu Val Ala Asp Gly Lys Phe Gly Thr Gin Glu Arg Val 
475 480 485 490 
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AAC AAC AGC AAC AAG AAG GAG TGG AAT GGG ATG ATG GGC GAG CTG CTC 

Asn Asn Ser Asn Lys Lys Glu Trp Asn Gly Met Met Gly Glu Leu LeS 
495 500 505 

Ser Gly Gin tla l*° £™ ?? C GTC GCG CCG CTA ACC AT * AAC GAG 

Ser Gly Gin Ala Asp Met He Val Ala Pro Leu Thr He Asn Asn Glu 

510 515 5 20 

Arg Ala Gin JJr ?S ^ TCC CCC TTC AAG TAC CAG 6CC CTG 

Arg Ala Gin Tyr He Glu Phe Ser Lys Pro Phe Lys Tyr Gin Gly Leu 

ACT ATT CTG CTC AAG AAG GAG ATT CCC CGG AGC ACG CTG GAC TCG TTC 
Thr lie Leu Val Lys Lys Glu He Pro Arg Ser Thr Leu Asp Ser He 

545 5 50 

ie° Gin p°o Ph C ACA CTG TGG CTG CTG GTG GGG "G TCG GTG 

Met Gin Pro Phe Gin Ser Thr Leu Trp Leu Leu Val Gly Leu Ser Val 

560 565 ' 570 

H AC ^ G ? C GTG ATG CTG TAC CTG CTG GAC CGC TTC AGC CCC TTC 

Hra Val Val Ala Vat Met Leu Tyr Leu Leu Asp Arg Phe Ser Pro lie 

575 580 585 

Gly Ara He ^ £? ™° ™° ^° GAG GAG G * G GAG CAC GCA CTG ACC 
Gly Arg Phe Lys Val Asn Ser Glu Glu Glu Glu Glu Asp Ala Leu Thr 
590 595 600 

CTG TCC TCG GCC ATG TGG TTC TCC TGG GGC GTC CTG CTC AAC TCC GGC 
Leu ser Ser Ala Met Trp Phe Ser Trp Gly Val Leu Seu itn Ser Gly 
bU5 610 61S 

tT° G ? C GCC CCC AGA AGC TTC TCA GCG CGC ATC CTG GGC ATG 

lie Gly Glu Gly Ala Pro Arg Ser Phe Ser Ala Arg He Su GiJ 

625 630 

GTG TGG GCC GGC TTT GCC ATG ATC ATC GTG GCC TCC TAC ACC GCC AAC 
Val Trp Ala Gly Phe Ala Met He He Val Ala Ser lyr J£ S£ ^ 

640 645 650 

2S Ala Ala pJe S° v*? J** GA ° ° GG CCG GAG GAG GGG ATC ACG GGC 
Leu Ala Ala Phe Leu Val Leu Asp Arg Pro Glu Glu Arg He Thr Gly 

655 660 ' 665 

H° A^n Asp Pro Aro f™ J™ AA< " CCC TCG GAC «T ATC TAC GCC 

lie Asn Asp Pro Arg Leu Arg Asn Pro Ser Asp Lys Phe He Tyr Ala 

675 680 

ACG GTG AAG CAG AGC TCC GTG GAT ATC TAC TTC CGG CGC CAG GTG gap 
Thr Val Lys Gin Ser Ser Val Asp He Tyr Phe Arg Arg cin vll gJu 

2! Ser ?hr Mel 1*° ?° ^ ATG GAG AAG CAC ^ C ™ G GAG ACT GCG 
Leu Ser Thr Met Tyr Arg fUs Met Glu Lys His Asn Tyr Glu Ser Ala 

705 710 

S SE S 55 SS K SI? S - « S g» - ™ ^ 
5S 2S S S SI! S X! JS SK S S SK 2£ SS 2S 5 
S5 J5 ?£ ^ SIS S S S 2S S SS S S °5 »? 

755 760 



1779 



1827 



1875 



1923 



1971 



2019 



2067 



2115 



2163 



2211 



2259 



2307 



2355 



2403 



2451 



2499 



2547 
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CGC AAA GAC AGC CCC TGG AAG CAG AAC GTC TCC CTG TCC ATC CTC AAG 2595 

Arg Lys Asp Ser Pro Trp Lys Gin Asn Val Ser Leu Ser lie Leu Lys 
765 770 775 

TCC CAC GAG AAT GGC TTC ATG GAA GAC CTG GAC AAG ACG TGG GTT CGG 2643 
Ser His Glu Asn Gly Phe Met Glu Asp Leu Asp Lys Thr Trp Val Arg 
780 785 790 

TAT CAG GAA TGT GAC TCG CGC AGC AAC GCC CCT GCG ACC CTT ACT TTT 2 691 

Tyr Gin Glu Cys Asp Ser Arg Ser Asn Ala Pro Ala Thr Leu Thr Phe 
795 800 805 810 

GAG AAC ATG GCC GGG GTC TTC ATG CTG GTA GCT GGG GGC ATC GTG GCC 2739 
Glu Asn Met Ala Gly Val Phe Met Leu Val Ala Gly Gly lie Val Ala 
815 820 " 825 

GGG ATC TTC CTG ATT TTC ATC GAG ATT GCC TAC AAG CGG CAC AAG GAT 2787 
Gly lie Phe Leu lie Phe lie Glu lie Ala Tyr Lys Arg His Lys Asp 
830 83 5 " ' 840 

GCT CGC CGG AAG CAG ATG CAG CTG GCC TTT GCC GCC GTT AAC GTG TGG 2 83 5 

Ala Arg Arg Lys Gin Met Gin Leu Ala Phe Ala Ala Val Asn Val Trp 
845 850 855 

CGG AAG AAC CTG CAG GAT AGA AAG AGT GGT AGA GCA GAG CCT GAC CCT 2883 
Arg Lys Asn Leu Gin Asp Arg Lys Ser Gly Arg Ala Glu Pro Asp Pro 
860 ~ 865 ~ 870 

AAA AAG AAA GCC ACA TTT AGG GCT ATC ACC TCC ACC CTG GCT TCC AGC 2931 
Lys Lys Lys Ala Thr Phe Arg Ala lie Thr Ser Thr Leu Ala Ser Ser 
875 880 885 890 

TTC AAG AGG CGT AGG TCC TCC AAA GAC ACG AGC ACC GGG GGT GGA CGC 2979 
Phe Lys Arg Arg Arg Ser Ser Lys Asp Thr Ser Thr Gly Gly Gly Arg 
895 900 905 

GGT GCT TTG CAA AAC CAA AAA GAC ACA GTG CTG CCG CGA CGC GCT ATT 3027 
Gly Ala Leu Gin Asn Gin Lys Asp Thr Val Leu Pro Arg Arg Ala lie 
910 915 920 

GAG AGG GAG GAG GGC CAG CTG CAG CTG TGT TCC CGT CAT AGG GAG AGC 307 5 

Glu Arg Glu Glu Gly Gin Leu Gin Leu Cys Ser Arg His Arg Glu Ser 
925 930 935 

TGAGACTCCC CGCCCGCCCT CCTCTGCCCC CTCCCCCGCA GAC AG AC AG A CAGACGGACG 3135 

GGACAGCGGC CCGGCCCACG CAGAGCCCCG GAG C ACC ACG GGGTCGGGGG AGGAGCACCC 3195 

CCAGCCTCCC CCAGGCTGCG CCTGCCCGCC CGCCGGTTGG CCGGCTGGCC GGTCCACCCC 3255 

GTCCCGGCCC CGCGCGTGCC CCCAGCGTGG GGCTAACGGG CGCCTTGTCT GTGTATTTCT 3315 

ATTTTGCAGC AGTACCATCC CACTGATATC ACGGGCCCGC TCAACCTCTC AGATCCCTCG 3375 

GTCAGCACCG TGGTGTGAGG CCCCCGGAGG CGCCCACCTG CCCAGTTAGC CCGGCCAAGG 3435 

ACACTGATGG GTCCTGCTGC TCGGG AAGGC CTGAGGGAAG CCCACCCGCC CCAGAGACTG 3495 

CCCACCCTGG GCCTCCCGTC CGTCCGCCCG CCCACCCCGC TGCCTGGCGG GCAGCCCCTG 3555 

CTGGACCAAG GTGCGGACCG GAGCGGCTGA GGACGGGGCA GAGCTGAGTC GGCTGGGCAG 3615 

GGCCGCAGGG CGCTCCGGCA GAGGCAGGCC CCTGGGGTCT CTGAGCAGTG GGGhGCGGGG 3 67 5 

GCTAACTGCC CCC AGG CGG A GGGGCTTGGA GCAGAGACGG CAGCCCCATC CTTCCCGCAG 3735 

CACCAGCCTG AGCCACAGTG GGGCCCKTGG CCCCAGCTGG CTGGGTCGCC CCTCCTCGGG 3795 
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CGCCTGCGCT CCTCTGCAGC CTGAGCTCCA CCCTCCCCTC TTCTTGCGGC ACCGCCCACC 3855 

AAACACCCCG TCTGCCCCTT GACGCCACAC GCCGGGGCTG GCGCTGCCCT CCCCCACGGC 3915 

CGTCCCTGAC TTCCCAGCTG GCAGCGCCTC CCGCCGCCTC GGGCCGCCTC CTCCAGAATC 3975 

GAGAGGGCTG AGCCCCTCCT CTCCTCGTCC GGCCTGCAGC ACAGAAGGGG GCCTCCCCGG 4035 

GGGTCCCCGG ACGCTGGCTC GGGACTGTCT TCAACCCTGC CCTGCACCTT GGGCACGGGA 4095 

GAGCGCCACC CGCCCGCCCC CGCCCTCGCT CCGGGTGCGT GACCGGCCCG CCACCTTGTA 4155 

CAGAACCAGC ACTCCCAGGG CCCGAGCGCG TGCCTTCCCC GTGCGCAGCC GCGCTCTGCC 4215 

CCTCCGTCCC CAGGGTGCAG GCGCGCACCG CCCAACCCCC ACCTCCCGGT GTATGCAGTG 42 75 

GTGATGCCTA AAGGAATGTC ACG 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 938 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Thr Met Arg Leu Leu Thr Leu Ala Leu Leu Phe Ser Cys Ser 
1 5 10 is 

Val Ala Arg Ala Ala Cys Asp Pro Lys He Val Asn He Gly Ala Val 
20 25 30 

Leu Ser Thr Arg Lys His Glu Gin Met Phe Arg Glu Ala Val Asn Gin 

40 45 

Ala Asn Lys Arg flis Gly Ser Trp Lys He Gin Leu Asn Ala Thr Ser 
DU 55 6 o 

Val Thr His Lys Pro Asn Ala He Gin Met Ala Leu Ser Val Cys Glu 
65 70 75 * ao 

Asp Leu He Ser Ser Gin Val Tyr Ala He Leu Val Ser His Pro Pro 
85 90 95 

Thr Pro Asn Asp His Phe Thr Pro Thr Pro Val Ser Tyr Thr Ala Gly 
100 105 110 r 

Phe Tyr Arg He Pro Val Leu Gly Leu Thr Thr Arg Met Ser He Tyr 
ii5 120 125 

Ser Asp Lys Ser He His Leu Ser Phe Leu Arg Thr Val Pro Pro Tyr 
I- 30 135 140 

Ser His Gin Ser Ser Val Trp Phe Glu Met Met Arg Val Tyr Ser Trp 
145 150 155 " 16 Q 

Asn His He He Leu Leu Val Ser Asp Asp His Glu Gly Arg Ala Ala 
155 f 170 175 

Gin Lys Arg Leu Glu Thr Leu Leu Glu Glu Arg Glu Ser Lys Ala Glu 
180 185 190 
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Lys Val Leu Gin Phe Asp Pro Gly Thr Lys Asn Val Thr Ala Leu Leu 
195 200 205 

Met Glu Ala Lys Glu Leu Glu Ala Arg Val He He Leu Ser Ala Ser 
210 215 220 

Glu Asp Asp Ala Ala Thr Val Tyr Arg Ala Ala Ala Met Leu Asn Met 
225 230 235 240 

GlyThr Gly Ser Gly Tyr Val Trp Leu Val Gly Glu Arg Glu He Ser 

245 250 255 

Asn Ala Leu Arg Tyr Ala Pro Asp Gly He Leu Gly Leu Gin Leu He 
260 265 "* 270 

Asn Gly Lys Asn Glu Ser Ala His He Ser Asp Ala Val Gly Val Val 
275 280 285 

Ala Gin Ala Val His Glu Leu Leu Glu Lys Glu Asn He Thr Asp Pro 
290 295 300 

Pro Arg Gly Cys Vat Gly Asn Thr Asn He Trp Lys Thr Gly Pro Leu 
305 310 315 320 

Phe Lys Arg Val Leu Met Ser Ser Lys Tyr Ala Asp Gly Val Thr Gly 
325 330 ~ 335 

Arg Val Glu Phe Asn Glu Asp Gly Asp Arg Lys Phe Ala Asn Tyr Ser 
340 345 350 

He Met Asn Leu Gin Asn Arg Lys Leu Val Gin Val Gly He Tyr Asn 
355 360 365 

Gly Thr His Val He Pro Asn Asp Arg Lys He He Trp Pro Gly Gly 
370 375 " 380 

Glu Thr Glu Lys Pro Arg Gly Tyr Gin Met Ser Thr Arg Leu Lys He 
385 390 395 400 

Val Thr He His Gin Glu Pro Phe Val Tyr Val Lys Pro Thr Leu Ser 
405 410 415 

Asp Gly Thr Cys Lys Glu Glu Phe Thr Val Asn Gly Asp Pro Val Lys 
420 425 ~ 430 

Lys Val He Cys Thr Gly Pro Asn Asp Thr Ser Pro Gly Ser Pro Arg 
435 440 445 

His Thr Val Pro Gin Cys Cys Tyr Gly Phe Cys He Asp Leu Leu He 
450 455 460 

Lys Leu Ala Arg Thr Met Asn Phe Thr Tyr Glu Val His Leu Val Ala 
465 470 475 480 

Asp Gly Lys Phe Gly Thr Gin Glu Arg Val Asn Asn Ser Asn Lys Lys 
485 490 495 

Glu Trp Asn Gly Met Met Gly Glu Leu Leu Ser Gly Gin Ala Asp Met 
500 505 510 

He Val Ala Pro Leu Thr He Asn Asn Glu Arg Ala Gin Tyr He Glu 
515 52Q 525 

Phe Ser Lys Pro Phe Lys Tyr Gin Gly Leu Thr He Leu Val Lys Lys 
530 535 540 
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Glu lie Pro Arg Ser Thr Leu Asp Ser Phe Met Gin Pro Phe Gin Ser 
5 550 555 560 

Thr Leu Trp Leu Leu Val Gly Leu Ser Val His Val Val Ala Va 1 Met 
565 570 575 

Leu Tyr Leu Leu Asp Arg Phe Ser Pro Phe Gly Arg Phe Lys Val Asn 
580 585 5 f 0 

Ser Glu Glu Glu Glu Glu Asp Ala Leu Thr Leu Ser Ser Ala Met Trp 
595 600 605 

Phe ser Trp Gly Val Leu Leu Asn Ser Gly He Gly Glu Gly Ala Pro 

615 6 2 o 

Arg Ser Phe Ser Ala Arg He Leu Gly Met Val Trp Ala Gly Phe Ala 
^ b 630 635 640 

Met He He Val Ala Ser Tyr Thr Ala Asn Leu Ala Ala Phe Leu Val 
645 650 655 

Leu Asp Arg Pro GW Glu Arg He Thr Gly He Asn Asp Pro Arg Leu 
660 665 670 

Arg Asn Pro Ser Asp Lys Phe He Tyr Ala Thr Val Lys Gin Ser Ser 
675 680 685 

Val Asp He Tyr Phe Arg Arg Gin Val Glu Leu Ser Thr Met Tyr Arg 

695 700 

His Met Glu Lys His Asn Tyr Glu Ser Ala Ala Glu Ala He Gin Ala 

710 715 7 2 o 

Val Arg Asp Asn Lys Leu His Ala Phe He Trp Asp Ser Ala Val Leu 
725 730 735 

Glu Phe Glu Ala Ser Gin Lys Cys Asp Leu Val Thr Thr Gly Glu Leu 
40 745 7 5 o 

Phe Phe Arg Ser Gly Phe Gly He Gly Met Arg Lys Asp Ser Pro Trp 
733 760 765 

Lys Gin Asn Val Ser Leu Ser He Leu Lys Ser His Glu Asn Gly Phe 

775 780 

785 ° 1U ASP LSU ASP Thr Tr P Val A ^g Tyr Gin Glu Cys Asp Ser 

790 7 95 ~ 800 

Arg Ser Asn Ala Pro Ala Thr Leu Thr Phe Glu Asn Met Ala Gly Val 
805 810 8 15 

Phe Met Leu Val Ala Gly Gly He Val Ala Gly He Phe Leu He Phe 
820 825 Q30 

He Glu He Ala Tyr Lys Arg His Lys Asp Ala Arg Arg Lys Gin Met 

840 345 

Gin Leu Ala Phe Ala Ala Val Asn Val Trp Arg Lys Asn Leu Gin Asp 

855 860 

all GlY Arg G1U Pr ° As P Pr ° ^s Lys Lys Ala Thr Phe 

870 ' 875 ago 

Arg Ala He Thr Ser Thr Leu Ala Ser Ser Phe Lys Arg Arg Arg Ser 
885 890 895 
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Ser Lys Asp Thr Ser Thr Gly Gly Gly Arg Gly Ala Leu Gin Asn Gin 
900 905 910 

Lys Asp Thr Val Leu Pro Arg Arg Ala He Glu Arg Glu Glu Glv Gin 
915 920 925 

Leu Gin Leu Cys Ser Arg His Arg Glu Ser 
930 935 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 63 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

AGT AAA AAA AGG AAC TAT GAA AAC CTC GAC CAA CTG TCC TAT GAC AAC 48 
Ser Lys Lys Arg Asn Tyr Glu Asn Leu Asp Gin Leu Ser Tyr Asp Asn 

AAG CGC GGA CCC AAG 

Lys Arg Gly Pro Lys 63 
20 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Ser Lys Lys Arg Asn Tyr Glu Asn Leu Asp Gin Leu Ser Tyr Asp Asn 
1 5 10 ls 

Lys Arg Gly Pro Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 5: - * 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) N AWE /KEY : CDS 

(B) LOCATION: 189.. 3899 



60 
120 
180 
230 



422 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

CCCTTAATAA GATTTGCCAC GTACACTCGA GCCATCGCGA GTGTCCTTG A GCCGCGGGTG 

ACGGTGGCTC TCGCTGCTCG CGCCCCCTCC TCCCGCGGGG GG AGCCTG AT GCCACGTTCC 

CTATGAATTA TTTATCGCCG GCCTAAAAAT ACCCCGAACT TCACAGCCCG AGTGACCCTC 

CGGTGGAC ATG GGT GGG GCC CTG GGG CCG GCC CTG TTG CTC ACC TCG CTC 
Met Gly Gly Ala Leu Gly Pro Ala Leu Leu Leu Thr Ser Leu 
1 5 10 

TTC GGT GCC TGG GCA GGG CTG GGT CCG GGG CAG GGC GAG CAG GGC ATG 2 78 

Phe Gly Ala Trp Ala Gly Leu Gly Pro Gly Gin Gly Glu Gin Gly Met 
15 20 25 30 

ACG GTG GCC GTG GTG TTT AGC AGC TCA GGG CCG CCC CAG GCC CAG TTC 32 6 

Thr Val Ala Val Val Phe Ser Ser Ser Gly Pro Pro Gin Ala Gin Phe 
35 40 45 

CGT GTC CGC CTC ACC CCC CAG AGC TTC CTG GAC CTA CCC CTG GAG ATC 374 
Arg Val Arg Leu Thr Pro Gin Ser Phe Leu Asp Leu Pro Leu Glu He 
50 55 L 60 

CAG CCG CTC ACA GTT GGG GTC AAC ACC ACC AAC CCC AGC AGC CTC CTC 
Gin Pro Leu Thr Val Gly Val Asn Thr Thr Asn Pro Ser Ser Leu Leu 
65 70 75 

ACC CAG ATC TGC GGC CTC CTG GGT GCT GCC CAC GTC CAC GGC ATT GTC 470 
Thr Gin He Cys Gly Leu Leu Gly Ala Ala His Val His Gly He Val 
80 85 90 

TTT GAG GAC AAC GTG GAC ACC GAG GCG GTG GCC CAG ATC CTT GAC TTC 
Phe Glu Asp Asn Val Asp Thr Glu Ala Val Ala Gin He Leu Asp Phe 
95 100 105 110 

ATC TCC TCC CAG ACC CAT GTG CCC ATC CTC AGC ATC AGC GGA GGC TCT 566 
He Ser Ser Gin Thr His Val Pro He Leu Ser He Ser Gly Gly Ser 
115 120 125 

GCT GTG GTC CTC ACC CCC AAG GAG CCG GGC TCC GCC TTC CTG CAG CTG 614 
Ala Val Val Leu Thr Pro Lys Glu Pro Gly Ser Ala Phe Leu Gin Leu 
130 135 140 

GGC GTG TCC CTG GAG CAG CAG CTG CAG GTG CTG TTC AAG GTG CTG GAA 
Gly Val Ser Leu Glu Gin Gin Leu Gin Val Leu Phe Lys Val Leu Glu 
145 150 155 

GAG TAC GAC TGG AGC GCC TTC GCC GTC ATC ACC AGC CTG CAC CCG GGC 710 
Glu Tyr Asp Trp Ser Ala Phe Ala Val He Thr Ser Leu His Pro Glv 
160 165 170 

CAC GCG CTC TTC CTG GAG GGC GTG CGC GCC GTC GCC GAC GCC AGC CAC 758 
His Ala Leu Phe Leu Glu Gly Val Arg Ala Val Ala Asp Ala Ser His 
175 180 185 190 

GTG AGT TGG CGG CTG CTG GAC GTG, GTC ACG CTG GAA CTG GAC CCG GGA 
Val Ser Trp Arg Leu Leu Asp Val Val Thr Leu Glu Leu Asp Pro Glv 
195 200 205 



518 



662 



806 
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GGG CCG CGC GCG CGC ACG CAG CGC CTG CTG CGC CAG CTC G AC GCG CCC 

Glv Pro Arg Ala Arg Thr Gin Arg Leu Leu Arg Gin Leu Asp Ala Pro 
Y 210 215 220 

GTG TTT GTG GCC TAC TGC TCG CGC GAG GAG GCC GAG GTG CTC TTC GCC 
Val Phe Val Ala Tyr Cys Ser Arg Glu Glu Ala Glu Val Leu Phe Ala 
225 ' 230 235 

GAG GCG GCG CAG GCC GGT CTG GTG GGG CCC GGC CAC GTG TGG CTG GTG 
Glu Ala Ala Gin Ala Gly Leu Val Gly Pro Gly His Val Trp Leu Val 
240 245 250 

CCC AAC CTG GCG CTG GGC AGC ACC GAT GCG CCC CCC GCC ACC TTC CCC 
Pro Asn Leu Ala Leu Gly Ser Thr Asp Ala Pro Pro Ala Thr Phe Pro 
255 260 265 270 

GTG GGC CTC ATC AGC GTC GTC ACC GAG AGC TGG CGC CTC AGC CTG CGC 
Val Glv Leu He Ser Val Val Thr Glu Ser Trp Arg Leu Ser Leu Arg 
275 280 285 

CAG AAG GTG CGC GAC GGC GTG GCC ATT CTG GCC CTG GGC GCC CAC AGC 
Gin Lvs Val Arg Asp Gly Val Ala He Leu Ala Leu Gly Ala His Se-r 
290 ' 295 300 



854 



902 



950 



998 



1046 



1094 



TAC TGG CGC CAG CAT GGA ACC CTG CCA GCC CCG GCC GGG GAC TGC CGT 1142 

Tvr Trp Arq Gin His Gly Thr Leu Pro Ala Pro Ala Gly Asp Cys Arg 

305 ' 310 315 

GTT CAC CCT GGG CCC GTC AGC CCT GCC CGG GAG GCC TTC TAC AGG CAC 1190 

Val His Pro Gly Pro Val Ser Pro Ala Arg Glu Ala Phe Tyr Arg His 
320 " 325 330 

CTA CTG AAT GTC ACC TGG GAG GGC CGA GAC TTC TCC TTC AGC CCT GGT 12 38 

Leu Leu Asn Val Thr Trp Glu Gly Arg Asp Phe Ser Phe Ser Pro Gly 

335 340 345 350 

GGG TAC CTG GTC CAG CCC ACC ATG GTG GTG ATC GCC CTC AAC CGG CAC 1286 

Gly Tyr Leu Val Gin Pro Thr Met Val Val He Ala Leu Asn Arg His 

355 360 365 

CGC CTC TGG GAG ATG GTG GGG CGC TGG GAG CAT GGC GTC CTA TAC ATG 1334 

Arq Leu Trp Glu Met Val Gly Arg Trp Glu His Gly Val Leu Tyr Met 

370 375 380 

AAG TAC CCC GTG TGG CCT CGC TAC AGT GCC TCT CTG CAG CCT GTG GTG 1382 

Lvs Tyr Pro Val Trp Pro Arg Tyr Ser Ala Ser Leu Gin Pro Val Val 

385 ~ 390 395 

GAC AGT CGG CAC CTG ACG GTG GCC ACG CTG GAA GAG CGG CCC TTT GTC 1430 

Asp Ser Arg His Leu Thr Val Ala Thr Leu Glu Glu Arg Pro Phe Val 
400 405 410 

ATC GTG GAG AGC CCT GAC CCT GGC ACA GGA GGC TGT GTC CCC AAC ACC 1478 

He Val Glu Ser Pro Asp Pro Gly Thr Gly Gly. Cys Val Pro Asn Thr 

415 420 425 430 

GTG CCC TGC CGC AGG CAG AGC AAC CAC ACC TTC AGC AGC GGG GAC GTG 1526 

Val Pro Cys Arg Arq Gin Ser Asn His Thr Phe Ser Ser Gly Asp Val 

435 440 445 

GCC CCC TAC ACC AAG CTC TGC TGT AAG GGA TTC TGC ATC GAC ATC CTC 157 4 

Ala Pro Tyr Thr Lys Leu Cys Cys' Lys Gly Phe Cys He Asp He Leu 

450 455 460 

AAG AAG CTG GCC AGA GTG GTC AAA TTC TCC TAC GAC CTG TAC CTG GTG 1622 

Lys Lys Leu Ala Arg Val Val Lys Phe Ser Tyr Asp Leu Tyr Leu Val 

465 470 475 
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ACC AAC GGC AAG CAT GGC AAG CGG GTG CGC GGC GTA TGG AAC GGC ATG 1670 

Thr Asn Gly Lys His Gly Lys Arg Val Arg Gly Val Trp Asn Gly Met 

480 485 490 

ATT GGG GAG GTG TAC TAC AAG CGG GCA GAC ATG GCC ATC GGC TCC CTC 1718 
He Gly Glu Val Tyr Tyr Lys Arg Ala Asp Met Ala He Gly Ser Leu 
495 500 505 510 

ACC ATC AAT GAG GAA CGC TCC GAG ATC GTA GAC TTC TCT GTA CCC TTT 1766 
Thr He Asn Glu Glu Arg Ser Glu He Val Asp Phe Ser Val Pro Phe 
515 520 525 

GTG GAG ACG GGC ATC AGT GTG ATG GTG GCT CGC AGC AAT GGC ACC GTC 1814 
Val Glu Thr Gly He Ser Val Met Val Ala Arg Ser Asn Gly Thr Val 
530 535 540 

TCC CCC TCG GCC TTC TTG GAG CCA TAT AGC CCT GCA GTG TGG GTG ATG 1862 
Ser Pro Ser Ala Phe Leu Glu Pro Tyr Ser Pro Ala Val Trp Val Met 
545 550 555 

ATG TTT GTC ATG TGC CTC ACT GTG GTG GCC ATC ACC GTC TTC ATG TTC 1910 
Met Phe Val Met Cys Leu Thr Val Val Ala He Thr Val Phe Met Phe 
560 565 570 

GAG TAC TTC AGC CCT GTC AGC TAC AAC CAG AAC CTC ACC AGA GGC AAG 19 58 

Glu Tvr Phe Ser Pro Val Ser Tyr Asn Gin Asn Leu Thr Arg Gly Lys 

-* _ _ r n r COO 



575 



580 ~ 585 590 



AAG TCC GGG GGC CCA GCT TTC ACT ATC GGC AAG TCC GTG TGG CTG CTG 
Lys Ser Gly Gly Pro Ala Phe Thr He Gly Lys Ser Val Trp Leu Leu 
595 600 605 
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TGG GCG CTG GTC TTC AAC AAC TCA GTG CCC ATC GAG AAC CCG CGG GGC 2054 

Trp Ala Leu Val Phe Asn Asn Ser Val Pro He Glu Asn Pro Arg Gly 

610 615 620 

ACC ACC AGC AAG ATC ATG GTT CTG GTC TGG GCC TTC TTT GCT GTC ATC 2102 

Thr Thr Ser Lys He Met Val Leu Val Trp Ala Phe Phe Ala Val He 

625 630 635 

TTC CTC GCC AGA TAC ACG GCC AAC CTG GCC GCC TTC ATG ATC CAA GAG 2150 

Phe Leu Ala Arg Tyr Thr Ala Asn Leu Ala Ala Phe Met He Gin Glu 

640 645 650 

CAA TAC ATC GAC ACT GTG TCG GGC CTC AGT GAC AAG AAG TTT CAG CGG 2198 

Gin Tyr He Asp Thr Val Ser Gly Leu Ser Asp Lys Lys Phe Gin Arg 

655 *" 660 665 670 

CCT CAA GAT CAG TAC CCA CCT TTC CGC TTC GGC ACG GTG CCC AAC GGC 2246 

Pro Gin Asp Gin Tyr Pro Pro Phe Arg Phe Gly Thr Val Pro Asn Gly 

675 680 685 

AGC ACG GAG CGG AAC ATC CGC AGT AAC TAC CGT GAC ATG CAC ACC CAC 2294 

Ser Thr Glu Arg Asn He Arg Ser Asn Tyr Arg Asp Met His Thr His 

690 695 700 

ATG GTC AAG TTC AAC CAG CGC TCG GTG GAG GAC GCG CTC ACC AGC CTC 2342 

Met Val Lys Phe Asn Gin Arg Ser Val Glu Asp Ala Leu Thr Ser Leu 

705 710 715 

AAG ATG GGG AAG CTG GAT GCC TTC ATC TAT GAT GCT GCT GTC CTC AAC 2 390 

Lys Met Gly Lys Leu Asp Ala Phe'lle Tyr Asp Ala Ala Val Leu Asn 

720 " 725 730 

TAC ATG GCA GGC AAG GAC GAG GGC TGC AAG CTG GTC ACC ATT GGG TCT 2438 

Tyr Met Ala Gly Lys Asp Glu Gly Cys Lys Leu Val Thr He Gly Ser 

735 740 745 750 
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GGC AAG GTC TTT GCT ACC ACT GGC TAC GGC ATC GCC ATG C AG AAG GAC 2486 

Gly Lys Val Phe Ala Thr Thr GLy Tyr Gly lie Ala Met Gin Lys Asp 
755 760 765 

TCC CAC TGG AAG CGG GCC ATA GAC CTG GCG CTC TTG CAG TTC CTG GGG 2534 
Ser His Trp Lys Arg Ala He Asp Leu Ala Leu Leu Gin Phe Leu Gly 
770 ' 775 780 

GAC GGA GAG ACA CAG AAA CTG GAG ACA GTG TGG CTC TCA GGG ATC TGC 2 582 

Asp Gly Glu Thr Gin Lys Leu Glu Thr Val Trp Leu Ser Gly He Cys 
785 ' 790 795 

CAG AAT GAG AAG AAC GAG GTG ATG AGC AGC AAG CTG GAC ATC GAC AAC 2 630 

Gin Asn Glu Lys Asn Glu Val Met Ser Ser Lys Leu Asp He Asp Asn 
800 ~ 805 810 

ATG GCA GGC GTC TTC TAC ATG CTG CTG GTG GCC ATG GGG CTG GCC CTG 2 678 

Met Ala Gly Val Phe Tyr Met Leu Leu Val Ala Met Gly Leu Ala Leu 
815 820 825 830 

CTG GTC TTC GCC TGG GAG CAC CTG GTC TAC TGG AAG CTG CGC CAC TCG 2726 
Leu Val Phe Ala Trp Glu His Leu Val Tyr Trp Lys Leu Arg His Ser 
835 840 845 

GTG CCC AAC TCA TCC CAG CTG GAC TTC CTG CTG GCT TTC AGC AGG GGC 2774 
Val Pro Asn Ser Ser Gin Leu Asp Phe Leu Leu Ala Phe Ser Arg Gly 
850 855 860 

ATC TAC AGC TGC TTC AGC GGG GTG CAG AGC CTC GCC AGC CCA CCG CGG 2822 
He Tyr Ser Cys Phe Ser Gly Val Gin Ser Leu Ala Ser Pro Pro Arg 
865 870 875 



CAG GCC AGC CCG GAC CTC ACG GCC AGC TCG GCC CAG GCC AGC GTG CTC 
Gin Ala Ser Pro Asp Leu Thr Ala Ser Ser Ala Gin Ala Ser Val Leu 
880 ~ 885 890 
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AAG ATG CTG CAG GCA GCC CGC GAC ATG GTG ACC ACG GCG GGC GTA AGC 2918 

Lys Met Leu Gin Ala Ala Arg Asp Met Val Thr Thr Ala Gly Val Ser 

895 900 905 910 

AGC TCC CTG GAC CGC GCC ACT CGC ACC ATC GAG AAT TGG GGT GGC GGC 2966 

Ser Ser Leu Asp Arg Ala Thr Arg Thr He Glu Asn Trp Gly Gly Gly 

915 920 925 

CGC CGT GCG CCC CCA CCG TCC CCC TGC CCG ACC CCG CGG TCT GGC CCC 3014 

Arg Arg Ala Pro Pro Pro Ser Pro Cys Pro Thr Pro Arg Ser Gly Pro 
930 935 940 

AGC CCA TGC CTG CCC ACC CCC GAC CCG CCC CCA GAG CCG AGC CCC ACG 3062 

Ser Pro Cys Leu Pro Thr Pro Asp Pro Pro Pro Glu Pro Ser Pro Thr 

945 950 955 

GGC TGG GGA CCG CCA GAC GGG GGT CGC GCG GCG CTT GTG CGC AGG GCT 3110 

Gly Trp Gly Pro Pro Asp Gly Gly Arg Ala AlaLeu Val Arg Arg Ala 
960 ' 965 970 

CCG CAG CCC CCG GGC CGC CCC CCG ACG CCG GGG CCG CCC CTG TCC GAC 3158 

Pro Gin Pro Pro Gly Arg Pro Pro Thr Pro Gly Pro Pro Leu Ser Asp 

975 980 985 990 

GTC TCC CGA GTG TCG CGC CGC CCA, GCC TGG GAG GCG CGG TGG CCG GTG 3206 

Val Ser Arg Val Ser Arg Arg Pro'Ala Trp Glu Ala Arg Trp Pro Val 

995 ~ 1000 1005 

CGG ACC GGG CAC TGC GGG AGG CAC CTC TCG GCC TCC GAG CGG CCC CTG 3254 

Arg Thr Gly His Cys Gly Arg His Leu Ser Ala Ser Glu Arg Pro Leu 
1010 * 1015 1020 



• 
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His Val Cys Leu His Ala His Ma His Leu Pro Phe Cys Trp Gly Ala 
1155 1160 H65 



TCG CCC GCG CGC TGT CAC TAC AGC TCC TTT CCT CGA GCC GAC CGA TCC 3302 

Ser Pro Ala Arg Cys His Tyr Ser Ser Phe Pro Arg Ala Asp Arg Ser 
102 5 1030 1035 

CCC CGC CCC TTC CTC CCC CTC TTC CCC GAG CCC CCG GAG CTG GAG GAC 3350 
Gly Arg Pro Phe Leu Pro Leu Phe Pro Glu Pro Pro Glu Leu Glu Asp 
1040 1045 1050 

CTG CCG CTG CTC GGT CCG GAG CAG CTG GCC CGG CGG GAG GCC CTG CTG 3398 
Leu Pro Leu Leu Gly Pro Glu Gin Leu Ala Arg Arg Glu Ala Leu Leu 
1055 1060 1065 1070 

CAC GCG GCC TGG GCC CGG GGC TCG CGC CCG CGT CAC CCT TCC CTG CCC 344 6 

His Ala Ala Trp Ala Arg Gly Ser Arg Pro Arg His Ala Ser Leu Pro 
1075 1080 1085 

AGC TCC GTG GCC GAG GCC TTC GCT CGG CCC AGC TCG CTG CCC CCT GGG 3494 
Ser Ser Val Ala Glu Ala Phe Ala Arg Pro Ser Ser Leu Pro Ala Gly 
1090 1095 1100 

TGC ACC GGC CCC GCC TGC GCC CGC CCC GAC GGA CAC TCG GCC TGC AGG 3542 
Cys Thr Gly Pro AW Cys Ala Arg Pro Asp Gly His Ser Ala Cys Arg 
1105 1110 ll 15 

CGC TTG GCG CAG GCG CAG TCG ATG TGC TTG CCG ATC TAC CGG GAG GCC 3590 
Arg Leu Ala Gin Ala Gin Ser Met Cys Leu Pro He Tyr Arg Glu Ala 
1120 ll 25 ll 30 

TGC CAG GAG GGC GAG CAG GCA GGG GCC CCC GCC TGG CAG CAC AGA CAG 3638 
Cys Gin Glu Gly Glu Gin Ala Gly Ala Pro Ala Trp Gin His Arg Gin 
1135 H40 H45 H50 

CAC GTC TGC CTG CAC GCC CAC GCC CAC CTG CCA TTT TGC TGG GGG GCT 3686 



3734 



GTC TGT CCT CAC CTT CCA CCC TGT GCC AGC CAC GGC TCC TGG CTC TCC 
Val Cys Pro His Leu Pro Pro Cys Ala Ser His Gly Ser Trp Leu Ser 
H70 H75 H80 

GGG GCC TGG GGG CCT CTG GGG CAC AGG GGC AGG ACT CTG GGG CTG GGC 3782 
Glv Ala Trp Gly Pro Leu Gly His Arg Gly Arg Thr Leu Gly Leu Gly 
Y 1185 H90 H95 

ACA GGC TAC AGA GAC AGT GGG GGA CTG GAC GAG ATC AGC AGG GTA GCC 3830 
Thr Gly Tyr Arg Asp Ser Gly Gly Leu Asp Glu He Ser Arg Val Ala 
1200 1205 1210 

CGT GGG ACG CAA GGC TTC CCG GGA CCC TGC ACC TGG AGA CGG ATC TCC 3878 
Arq Gly Thr Gin Gly Phe Pro Gly Pro Cys Thr Trp Arg Arg He Ser 
1215 1220 1225 1230 

AGT CTG GAG TCA GAA GTG TGAGTTATCA GCCACTCAGG CTCCGAGCCA 3926 
Ser Leu Glu Ser Glu Val 
1235 

GCTGGATTCT CTGCCTGCCA CTGTCAGGGT TAAGCGGCAG GCAGGATTGG GCTTTTCTGG 3986 

CTTCTACCAT GAAATCCTGG CCATGGGACC CCAGTGACAG ATGATGTCTT CCATGGTCAT 404 6 

CAGTGACCTC AGTAGCCTCA AATCATGGTG AGGGCTGGGC TTTTGCTGTC CTCTTCTCAC 4106 

GCAGAGTTCT GCCAGGAGGG TGTGCTGTGG GGGTCAGACT CCTGAGGCTC TCCCTTCCCT 4166 

GGGGCTAGCC AGTTACTGGT CATGCCTGCT GTGGGCATGG AGGCTGGAAC TTGTGGTTGA 42 2 6 
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GGCAGCGCCA TCCCCATCCT TGCTCTACCT GGCTAGAGTT TCTTCTCATC AGAGCACTGG 4286 
G AC ATT AAA C CCACCTTTTC CCAG AAAAAA A AAA AAAAA A A AAA A A AAA A AAAG 43 4 0 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1236 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Gly Gly Ala Leu Gly Pro Ala Leu Leu Leu Thr Ser Leu Phe Gly 
1 5 1° 15 

Ala Trp Ala Gly Leu Gly Pro Gly Gin Gly Glu Gin Gly Met Thr Val 

25 30 



20 



Ala Val Val Phe Ser Ser Ser Gly Pro Pro Gin Ala Gin Phe Arg Val 
35 40 45 

Arq Leu Thr Pro Gin Ser Phe Leu Asp Leu Pro Leu Glu lie Gin Pro 
50 55 60 

Leu Thr Val Gly Val Asn Thr Thr Asn Pro Ser Ser Leu Leu Thr Gin 
65 70 75 80 

lie Cvs Gly Leu Leu Gly Ala Ala His Val His Gly He Val Phe Glu 
85 90 9 5 

Asp Asn Val Asp Thr Glu Ala Val Ala Gin He Leu Asp Phe He Ser 
100 105 HO 

Ser Gin Thr His Val Pro He Leu Ser He Ser Gly Gly Ser Ala Val 
115 120 125 

Val Leu Thr Pro Lys Glu Pro Gly Ser Ala Phe Leu Gin Leu Gly Val 
130 ' 135 140 

Ser Leu Glu Gin Gin Leu Gin Val Leu Phe Lys Val Leu Glu Glu Tyr 
145 150 155 160 

Asp Trp Ser Ala Phe Ala Val He Thr Ser Leu His Pro Gly His Ala 
165 170 175 

Leu Phe Leu Glu Gly Val Arg Ala Val Ala Asp Ala Ser His Val Ser 
180 ' 185 190 

Trp Arq Leu Leu Asp Val Val Thr Leu Glu Leu Asp Pro Gly Gly Pro 
195 200 205 

Arg Ala Arg Thr Gin Arg Leu Leu Arg Gin Leu Asp Ala Pro Val Phe 
210 215 220 

Val Ala Tyr Cys Ser Arg Glu Glu Ala Glu Val Leu Phe Ala Glu Ala 
225 230 235 240 

Ala Gin Ala Gly Leu Val Gly Pro' Gly His Val Trp Leu Val Pro Asn 
" * 2 50 255 



245 



Leu Ala Leu Gly Ser Thr Asp Ala Pro Pro Ala Thr Phe Pro Val Gly 
260 265 270 
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Leu He Ser Val Val Thr Glu Ser Trp Arg Leu Ser Leu Arg Gin Lys 
275 280 285 

Val Arq Asp Gly Val Ala He Leu Ala Leu Gly Ala His Ser Tyr Trp 
290 295 300 

Arq Gin His Gly Thr Leu Pro Ala Pro Ala Gly Asp Cys Arg Val His 
305 310 315 320 

Pro Gly Pro Val Ser Pro Ala Arg Glu Ala Phe Tyr Arg His Leu Leu 
325 330 335 

Asn Val Thr Trp Glu Gly Arg Asp Phe Ser Phe Ser Pro Gly Gly Tyr 
340 345 350 

Leu Val Gin Pro Thr Met Val Val He Ala Leu Asn Arg His Arg Leu 
355 360 365 

Trp Glu Met Val Gly Arg Trp Glu His Gly Val Leu Tyr Met Lys Tyr 
370 375 380 

Pro Val Trp Pro Arg-' Tyr Ser Ala Ser Leu Gin Pro Val Val Asp Ser 
385 390 395 400 

Arq His Leu Thr Val Ala Thr Leu Glu Glu Arg Pro Phe Val He Val 
405 410 415 

Glu Ser Pro Asp Pro Gly Thr Gly Gly Cys Val Pro Asn Thr Val Pro 
420 ' 425 430 

Cys Arq Arq Gin Ser Asn His Thr Phe Ser Ser Gly Asp Val Ala Pro 
435 440 445 

Tyr Thr Lys Leu Cys Cys Lys Gly Phe Cys He Asp He Leu Lys Lys 
450 455 460 

Leu Ala Arg Val Val Lys Phe Ser Tyr Asp Leu Tyr Leu Val Thr Asn 
465 ~ 470 475 480 

Gly Lys His Gly Lys Arg Val Arg Gly Val Trp Asn Gly Met He Gly 
485 490 495 

Glu Val Tyr Tyr Lys Arg Ala Asp Met Ala He Gly Ser Leu Thr He 
500 505 510 

Asn Glu Glu Arg Ser Glu He Val Asp Phe Ser Val Pro Phe Val Glu 
515 520 525 

Thr Gly He Ser Val Met Val Ala Arg Ser Asn Gly Thr Val Ser Pro 
530 535 540 

Ser Ala Phe Leu Glu Pro Tyr Ser Pro Ala Val Trp Val Met Met Phe 
545 550 555 560 

Val Met Cys Leu Thr Val Val Ala He Thr Val Phe Met Phe Glu Tyr 
565 570 575 

Phe Ser Pro Val Ser Tyr Asn Gin Asn Leu Thr Arg Gly Lys Lys Ser 
580 ^ 585 590 

Gly Gly Pro Ala Phe Thr He Gly Lys Ser Val Trp Leu Leu Trp Ala 
595 600' 605 

Leu Val Phe Asn Asn Ser Val Pro He Glu Asn Pro Arg Gly Thr Thr 
610 615 620 
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JJJ L,. He „« v .i l. u v.l Trp A1 . Phe „ „. Val „. ^ ^ 

640 

«a ^ ryr Thr „ Aan Leu „, M . ^ ^ ^ ^ 

650 655 

.1. ASP T„r v.l Ser OIy Leu ,„ Lys p ^ o 

" 670 

».P «n gj Pro Pro Ph , Rrg oly Thr vai a ^ ^ 

68 5 

Glu Arg Asn lie Arg Ser Asn a 

"0 9 J™ Ar ^ As P Me t His Thr His Met Va 1 

Lys Phe Asn Gin Ara Ser- l <~i 

705 9 7^0 ASP Ala L ?" Th - Leu Lys Met 

Gly Lys Leu Asp Ala Phe lie Tyr Asn ala al „ , 

725 7 ASP Ala Val Leu Asn Tyr Met 

»x. =i y i y . ?J p olu oly Cy . Ly „ yai iu ^ 

750 

PPe 755 ry, Cg „. Ma „« Cl „ Lys s=p „, „ u 

/bU 765 
Ly. Ar 9 „. n . ssp Ala L «„ L . u „„ ^ „ ly ^ ^ 

«j ». cm Lya L .„ „„ Thr val Trp L>u ™ n> ^ ^ 

_ 5 800 

Glu Lys Asn Glu Val Met Ser- <^ r 

805 ^ LyS As P Asp Asn Met Ala 

815 

Gly Val Phe Tyr Met Leu Leu Val Ala Met n , 

820 ai A ; a Met G1 Y Leu Ala Leu Leu Val 

830 

Phe Ala Trp Glu His Leu Val Tvr t 

835 Val Tr P Leu Arg His Ser Val Pro 

845 

-» s„ ser c ln L « u A . p ^ Leu ftu ;s ^ ^ ^ 

;.j c y . Ph . s « r Cly OIB ser Leu ri> ^ ^ ^ ^ 

- n 875 880 

Ser Pro Asp Leu Thr Ala Ser Ser Ala rin «i «. 

885 6r Ala Ala Val Leu Lys Met 

895 

Leu «n M. M. Sra A . p Met val Ma c ^ ^ s ^ 

yUb 910 
- A.p Jjj M . Thr Arg Thc „. clu fl>n ^ ^ c ^ ^ g B ^ 

PJO PPO P„ S« P„ c p„ Thc p „ firg 

J:3 940 

3- -u pp. Thr Pro ; . g Pro Pro pro ciu p ^ t ^ ^ 

01 ' 960 

=1, >~ P„ ,. p „, Gly flpg A1 , ^ ^ ^ 



107 

Pro Pro Gly Arg Pro Pro Thr Pro Gly Pro Pro Leu Ser Asp Val Ser 
980 985 990 

Arg Val Ser Arg Arg Pro Ala Trp Glu Ala Arg Trp Pro Val Arg Thr 

995 iooo 1005 

Gly Hi^Cys Gly Arg His Leu Ser Ala Ser Glu Arg Pro Leu Ser Pro 



1020 



Ala Arg Cys His Tyr Ser Ser Phe Pro Arg Ala Asp Arg Ser Gly Arg 

1030 1035 104 0 

Pro Phe Leu Pro Leu Phe Pro Glu Pro Pro Glu Leu Glu Asp Leu Pro 

1045 !°50 1055 

Leu Leu Gly Pro Glu Gin Leu Ala Arg Arg Glu Ala Leu Leu His Ala 

1060 1065 10? 0 

Ala Trp Ala^Arg Gly Ser Arg Pro^Arg His Ala Ser Leu Pro Ser Ser 

?o"90 G1U A1 \ Phe Ala ^I c Pr ° Ser Ser Leu Pro Ala Cys .Thr 

1095 1100 

Gly^Pro Ala Cys Ala Arg Pro Asp Gly His Ser Ala Cys Arg Arg Leu 

1115 1120 

Ala Gin Ala Gin Ser Met Cys Leu Pro He Tyr Arg Glu Ala Cys Gin 
1125 1130 !i3 5 

Glu Gly Glu Gin Ala Gly Ala Pro Ala Trp Gin His Arg Gin His Val 

1145 1150 

Cys Leu His Ala His Ala His Leu Pro Phe Cys Trp Gly Ala Val Cys 
1155 1160 1165 

Pro Hi^Leu Pro Pro Cys Ala Ser His Gly Ser Trp Leu Ser Gly Ala 

±1/5 1180 
TrpGly Pro Leu Gly Hi. Arg Gly Arg Thr Leu Gly Leu Gly Thr Gly 

1195 12 00 

Tyr Arg Asp Ser Gly Gly Leu Asp Glu He Ser Arg Val Ala Arg Gly 
1205 1210 12 ? 5 Y 

Thr Gin Gly Phe Pro Gly Pro Cys Thr Trp Arg Arg He Ser Ser Leu 

1220 "25 1230 

Glu Ser Glu Val 
1235 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . . 22 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

C TCT GAG GCT CAG CCT GTC CCC AG 
Ser Glu Ala Gin Pro Val Pro 
1 5 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Ser Glu Ala Gin Pro Val Pro 
1 5 

(2) INFORMATION FOR: SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
AGAAGGGGGT G 11 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

( ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 311.. 4705 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: - 

ATCATGGGAC CGGGTGAGCG CTGAGAATCG CGGCCGCAGC CATCAGCCCT GGAGATGACC 60 

AGGAGCGGCC ACTGCTGAGA ACTATGTGGA GAGAGGCTGC GAGCCCTGCT GCAGAGCCTC 120 

CGGCTGGGAT AGCCGCCCCC CGTGGGGGCG ATGCGGACAG CGCGGGACAG CCAGGGGAGC 180 

GCGCTGGGGC CGCAGCATGC GGGAACCCGC TAAACCCGGT GGCTGCTGAG GCGGCCGAGA 240 

TGCTCGTGCG CGCAGCGCGC CCCACTGCAT CCTCGACCTT CTCGGGCTAC AGGGACCGTC 300 
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AGTGGCGACT ATG GGC AG A GTG GGC TAT TGG ACC CTC CTG GTG CTG CCC 

Met Cly Arg Va 1 Gi y T yr Trp Thr Leu Leu vll Su Pro 349 

5 10 



60 



s?s sis sss sis as si? as si? sa ai sis sis 21? as as ks 

70 75 



X S 2K K? SIS - JS SS !S SJ ?K 55 21? IS S?S S2 

8 5 



90 



165 170 



245 250 



TTC TGG ATT GTC CCC AGC TTG GTC TPT rrr- 

Phe Trp He V al Pro Ser Leu v!i I 2?° AAC ACG GAG CTC ATC CCA 
255 er ^ Val Ser G1 y *sn Thr Glu Leu He Pro 



397 



SS SII SIS SI? I?S a" S?I as ss as a? ss a? ss SK as 

20 25 

s?i as as as sa is ai as si? 21? sis s?i as as as «■ 
si? ?s as as sk sn a, a; sis as sa ?s as as as sa - 



541 



589 



637 



2S S 22 S?S S SI? Ill S5 a? £ S 2S SK SK S SI? 

as sk sis sis ai ill as is is sk a? ™ si? ~ - « - 

12 0 

S?S S SI S?S S?S S III SIS KS SIS -s - - »; « ~ 733 

135 140 

IS KS III IIS sk III s; SS IS JK as SK sk as ks si? - 
21? SIS 2£ ?IS 21? SK s; »; « - - - ™ ™ ™ « 829 



KS KI J2 ™ SSI S?S ?2S 2?S SK IIS K° 2 CC T 0TC •*» «« 

175 7 9 S Ile Ser Phe Val Lys Thr 

180 1Q5 

Jhr Sa^ A^p A^n Ser HI ?S ^ 3°° ? C AT ° CAG ** T GTG * TC * C * 
190 fq? Val G1 y Tr P As P Met Gin Asn Val He Thr 

200 205 

SIS 2? KI IS? Ill SK Asp S2 K? KS SK SI? SK SIS KS £ 

215 220 
ATC CAC TCT TCT GTC ATC TTG CTC TAC TPT Trr 

U. His S „ j. „ ai „. Le „ J= ™= ™ TCC AAA CAC CAC CCT CXT 

2 3 0 2 3 5 

S JS Su ^ S CTT GGC CTC ACC GGG TAT GAT TTC 

240 a LeU y Leu Thr Gly Tyr Asp Phe 



973 



1021 



1069 



1117 



260 265 
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AAA GAG TTT CCA TCG GGA CTC ATT TCT GTC TCC TAC GAT G AC TGG GAC 1165 
Lys Glu Phe Pro Ser Gly Leu lie Ser Val Ser Tyr Asp Asp Trp Asp 
2 Y 10 275 280 285 

TAC AGC CTG GAG GCG AGA GTG AGG GAC GGC ATT GGC ATC CTA ACC ACC 1213 
Tyr Ser Leu Glu Ala Arg Val Arg Asp Gly He Gly He Leu Thr Thr 
1 290 295 300 

GCT GCA TCT TCT ATG CTG GAG AAG TTC TCC TAC ATC CCC GAG GCC AAG 1261 
Ala Ala Ser Ser Met Leu Glu Lys Phe Ser Tyr He Pro Glu Ala Lys 
305 310 315 

GCC AGC TGC TAC GGG CAG ATG GAG AGG CCA GAG GTC CCG ATG CAC ACC 1309 
Ala Ser Cys Tyr Gly Gin Met Glu Arg Pro Glu Val Pro Met His Thr 
320 325 330 

TTG CAC CCA TTT ATG GTC AAT GTT ACA TGG GAT GGC AAA GAC TTA TCC 1357 
Leu His Pro Phe Met Val Asn Val Thr Trp Asp Gly Lys Asp Leu Ser 
335 340 345 

TTC ACT GAG GAA GGC TAC CAG GTG CAC CCC AGG CTG GTG GTG ATT GTG 1405 
Phe Thr Glu Glu Gly Tyr Gin Val His Pro Arg Leu Val Val He Val 
350 355 360 365 

CTG AAC AAA GAC CGG GAA TGG GAA AAG GTG GGC AAG TGG GAG AAC CAT 1453 
Leu Asn Lys Asp Arg Glu Trp Glu Lys Val Gly Lys Trp Glu Asn His 
370 375 380 

ACG CTG AGC CTG AGG CAC GCC GTG TGG CCC AGG TAC AAG TCC TTC TCC 1501 
Thr Leu Ser Leu Arg His Ala Val Trp Pro Arg Tyr Lys Ser Phe Ser 
385 390 395 

GAC TGT GAG CCG GAT GAC AAC CAT CTC AGC ATC GTC ACC CTG GAG GAG 1549 
Asp Cys Glu Pro Asp Asp Asn His Leu Ser He Val Thr Leu Glu Glu 
400 * 405 410 

GCC CCA TTC GTC ATC GTG GAA GAC ATA GAC CCC CTG ACC GAG ACG TGT 1597 
Ala Pro Phe Val He Val Glu Asp He Asp Pro Leu Thr Glu Thr Cys 
415 420 425 

GTG AGG AAC ACC GTG CCA TGT CGG AAG TTC GTC AAA ATC AAC AAT TCA 1645 
Val Arg Asn Thr Val Pro Cys Arg Lys Phe Val Lys He Asn Asn Ser 
430 435 440 445 

ACC AAT GAG GGG ATG AAT GTG AAG AAA TGC TGC AAG GGG TTC TGC ATT 169 3 

Thr Asn Glu Gly Met Asn Val Lys Lys Cys Cys Lys Gly Phe Cys He 
450 455 460 

GAT ATT CTG AAG AAG CTT TCC AGA ACT GTG AAG TTT ACT TAC GAC CTC 1741 
Asp He Leu Lys Lys Leu Ser Arg Thr Val Lys Phe Thr Tyr Asp Leu 
465 470 475 

TAT CTG GTG ACC AAT GGG AAG CAT GGC AAG AAA GTT AAC AAT GTG TGG 1789 
Tyr Leu Val Thr Asn Gly Lys His Gly Lys Lys Val Asn Asn Val Trp 
480 485 490 

AAT GGA ATG ATC GGT GAA GTG GTC TAT CAA CGG GCA GTC ATG GCA GTT 1837 
Asn Gly Met He Gly Glu Val Val Tyr Gin Arg Ala Val Met Ala Val 
495 500 505 

GGC TCG CTC ACC ATC AAT GAG GAA, CGT TCT GAA GTG GTG GAC TTC TCT 1885 
Gly Ser Leu Thr He Asn Glu Glu Arg Ser Glu Val Val Asp Phe Ser 
510 515 520 525 

GTG CCC TTT GTG GAA ACG GGA ATC AGT GTC ATG GTT TCA AGA AGT AAT 1933 
Val Pro Phe Val Glu Thr Gly He Ser Val Met Val Ser Arg Ser Asn 
530 535 540 
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2029 



2077 



2125 



2173 



GGC ACC GTC TCA CCT TCT GCT TTT CTA GAA CCA TTC AGC CCC TCT GTC 1981 
Gly Thr Val Ser Pro Ser Ala Phe Leu Glu Pro Phe Ser Ala Ser Val 
1 545 550 ^55 

TGG GTG ATG ATG TTT GTG ATG CTG CTC ATT GTT TCT GCC ATA GCT GTT 
Trp Val Met Met Phe Val Met Leu Leu lie Val Ser Ala He Ala Val 
P 560 565 570 

TGG GTC TTG GAT TAC TCC AGC CCT GTT GGA TAC AAC AGA AAC TTA GCC 
Trp Val Leu Asp Tyr Ser Ser Pro Val Gly Tyr Asn Arg Asn Leu Ala 
* 575 580 585 

AAA GGG AAA GCA CCC CAT GGG CCT TCT TTT ACA ATT GGA AAA GCT ATA 
Lys Gly Lys Ala Pro His Gly Pro Ser Phe Thr He Gly Lys Ala lie 
590 595 600 605 

TGG CTT CTT TGG GGC CTG GTG TTC AAT AAC TCC GTG CCT GTC CAG AAT 
Trp Leu Leu Trp Gly Leu Val Phe Asn Asn Ser Val Pro Val Gin Asn 
610 615 620 

CCT AAA GGG ACC ACC AGC AAG ATC ATG GTA TCT GTA TGG GCC TTC TTC 2221 
Pro Lys Gly Thr Thr"" Ser Lys He Met Val Ser Val Trp Ala Phe Phe 
625 630 635 

GCT GTC ATA TTC CTG GCT AGC TAC ACA GCC AAT CTG GCT GCC TTC ATG 2 2 69 

Ala Val He Phe Leu Ala Ser Tyr Thr Ala Asn Leu Ala Ala Phe Met 
640 645 650 

ATC CAA GAG GAA TTT GTG GAC CAA GTG ACC GGC CTC AGT GAC AAA AAG 2317 
He Gin Glu Glu Phe Val Asp Gin Val Thr Gly Leu Ser Asp Lys Lys 
655 660 665 

TTT CAG AGA CCT CAT GAC TAT TCC CCA CCT TTT CGA TTT GGG ACA GTG 2365 
Phe Gin Arg Pro His Asp Tyr Ser Pro Pro Phe Arg Phe Gly Thr Val 
670 675 680 685 

CCT AAT GGA AGC ACG GAG AGA AAC ATT CGG AAT AAC TAT CCC TAC ATG 2413 
Pro Asn Gly Ser Thr Glu Arg Asn He Arg Asn Asn Tyr Pro Tyr Met 
690 695 700 

CAT CAG TAC ATG ACC AAA TTT AAT CAG AAA GGA GTA GAG GAC GCC TTG 2461 
His Gin Tyr Met Thr Lys Phe Asn Gin Lys Gly Val Glu Asp Ala Leu 
705 710 715 

GTC AGC CTG AAA ACG GGG AAG CTG GAC GCT TTC ATC TAC GAT GCC GCA 2 509 

Val Ser Leu Lys Thr Gly Lys Leu Asp Ala Phe lie Tyr Asp Ala Ala 
720 725 730 

GTC TTG AAT TAC AAG GCT GGG AGG GAT GAA GGC TGC AAG CTG GTG ACC 2 5 57 

Val Leu Asn Tyr Lys Ala Gly Arg Asp Glu Gly Cys Lys Leu Val Thr 
735 740 745 

ATC GGG AGT GGG TAC ATC TTT GCC ACC ACC GGT TAT GGA ATT GCC CTT 2 605 

He Gly Ser Gly Tyr He Phe Ala Thr Thr Gly Tyr Gly He Ala Leu 
750 755 760 765 

CAG AAA GGC TCT CCT TGG AAG AGG CAG ATC GAC CTG GCC TTG CTT CAG 2653 
Gin Lys Gly Ser Pro Trp Lys Arg Gin He Asp Leu Ala Leu Leu Gin 
770 775 780 

TTT GTG GGT GAT GGT GAG ATG GAG, GAG CTG GAG ACC CTG TGG CTC ACT 2701 
Phe Val Gly Asp Gly Glu Met Glu Glu Leu Glu Thr Leu Trp Leu Thr 
785 790 795 

GGG ATC TGC CAC AAC GAG AAG AAC GAG GTG ATG AGC AGC CAG CTG GAC 2749 
Gly He Cys His Asn Glu Lys Asn Glu Val Met Ser Ser Gin Leu Asp 
800 805 810 
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ATT GAC AAC ATG GCG GGC GTA TTC TAC ATG CTG GCT GCC GCC ATG GCC 2797 

lie Asp Asn Met Ala Gly Val Phe Tyr Met Leu Ala Ala Ala Met Ala 

815 820 ' 825 

CTT AGC CTC ATC ACC TTC ATC TGG GAG CAC CTC TTC TAC TGG AAG CTG 2845 
Leu Ser Leu lie Thr Phe lie Trp Glu His Leu Phe Tyr Trp Lys Leu 
830 835 840 ' " 845 

CGC TTC TGT TTC ACG GGC GTG TGC TCC GAC CGG CCT GGG TTG CTC TTC 2893 
Arg Phe Cys Phe Thr Gly Val Cys Ser Asp Arg Pro Gly Leu Leu Phe 
850 855 ^ 860 

TCC ATC AGC AGG GGC ATC TAC AGC TGC ATT CAT GGA GTG CAC ATT GAA 2941 
Ser lie Ser Arg Gly He Tyr Ser Cys He His Gly Val His He Glu 
865 870 875 

GAA AAG AAG AAG TCT CCA GAC TTC AAT CTG ACG GGA TCC CAG AGC AAC 2989 
Glu Lys Lys Lys Ser Pro Asp Phe Asn Leu Thr Gly Ser Gin Ser Asn 
880 885 890 

ATG TTA AAA CTC CTC CGG TCA GCC AAA AAC ATT TCC AGC ATG TCC AAC 3037 
Met Leu Lys Leu L^ir Arg Ser Ala Lys Asn He Ser Ser Met Ser Asn 
895 900 905 

ATG AAC TCC TCA AGA ATG GAC TCA CCC AAA AGA GCT GCT GAC TTC ATC 3085 
Met Asn Ser Ser Arg Met Asp Ser Pro Lys Arg Ala Ala Asp Phe He 
910 915 920 925 

CAA AGA GGT TCC CTC ATC ATG GAC ATG GTT TCA GAT AAG GGG AAT TTG 313 3 

Gin Arg Gly Ser Leu He Met Asp Met Val Ser Asp Lys Gly Asn Leu 
930 935 * " 940 

ATG TAC TCA GAC AAC AGG TCC TTT CAG GGG AAA GAG AGC ATT TTT GGA 3181 
Met Tyr Ser Asp Asn Arg Ser Phe Gin Gly Lys Glu Ser He Phe Gly 
945 950 955 

GAC AAC ATG AAC GAA CTC CAA ACA TTT GTG GCC AAC CGG CAG AAG GAT 3229 
Asp Asn Met Asn Glu Leu Gin Thr Phe Val Ala Asn Arg Gin Lys Asp 
960 965 970 

AAC CTC AAT AAC TAT GTA TTC CAG GGA CAA CAT CCT CTT ACT CTC AAT 3277 
Asn Leu Asn Asn Tyr Val Phe Gin Gly Gin His Pro Leu Thr Leu Asn 
975 980 985 

GAG TCC AAC CCT AAC ACG GTG GAG GTG GCC GTG AGC ACA GAA TCC AAA 332 5 

Glu Ser Asn Pro Asn Thr Val Glu Val Ala Val Ser Thr Glu Ser Lys 
990 995 1000 1005 

GCG AAC TCT AGA CCC CGG CAG CTG TGG AAG AAA TCC GTG GAT TCC ATA 3373 
Ala Asn Ser Arg Pro Arg Gin Leu Trp Lys Lys Ser Val Asp Ser He 
1010 1015 " 1020 

CGC CAG GAT TCA CTA TCC CAG AAT CCA GTC TCC CAG AGG GAT GAG GCA 3421 
Arg Gin Asp Ser Leu Ser Gin Asn Pro Val Ser Gin Arg Asp Glu Ala 
1025 1030 1035 

ACA GCA GAG AAT AGG ACC CAC TCC CTA AAG AGC CCT AGG TAT CTT CCA 3469 
Thr Ala Glu Asn Arg Thr His Ser Leu Lys Ser Pro Arg Tyr Leu Pro 
1040 1045 1050 

GAA GAG ATG GCC CAC TCT GAC ATT TCA GAA ACG TCA AAT CGG GCC ACG 3517 
Glu Glu Met Ala His Ser Asp He' Ser Glu Thr Ser Asn Arg Ala Thr 
1055 1060 1065 

TGC CAC AGG GAA CCT GAC AAC AGT AAG AAC CAC AAA ACC AAG GAC AAC 3 5 65 

Cys His Arg Glu Pro Asp Asn Ser Lys Asn His Lys Thr Lys Asp Asn 
1Q 70 1075 1080 ^ 1085 
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TTT AAA AGG TCA GTG GCC TCC AAA TAC CCC AAG GAC TGT AGT GAG GTC 3613 
Phe Lys Arg Ser Val Ala Ser Lys Tyr Pro Lys Asp Cys Ser Glu Val 
1090 1095 ' * 1100 

GAG CGC ACC TAC CTG AAA ACC AAA TCA AGC TCC CCT AGA GAC AAG ATC 3661 
Glu Arg Thr Tyr Leu Lys Thr Lys Ser Ser Ser Pro Arg Asp Lys lie 
1105 mo ins 

TAC ACT ATA GAT GGT GAG AAG GAG CCT GGT TTC CAC TTA GAT CCA CCC 3709 
Tyr Thr He Asp Gly Glu Lys Glu Pro Gly Phe His Leu Asp Pro Pro 
1120 H25 ' H30 

CAG TTT GTT GAA AAT GTG ACC CTG CCC GAG AAC GTG GAC TTC CCG GAC 3757 
Gin Phe Val Glu Asn Val Thr Leu Pro Glu Asn Val Asp Phe Pro Asp 
1135 1140 1145 

CCC TAC CAG GAT CCC AGT GAA AAC TTC CGC AAG GGG GAC TCC ACG CTG 3805 
Pro Tyr Gin Asp Pro Ser Glu Asn Phe Arg Lys Gly Asp Ser Thr Leu 
1150 H55 H60 H65 

CCA ATG AAC CGG AAC CCC TTG CAT AAT GAA GAG GGG CTT TCC AAC AAC 3853 
Pro Met Asn Arg Asn Pro Leu His Asn Glu Glu Gly Leu Ser Asn Asn 
-M.7 0 1175 1180 

GAC CAG TAT AAA CTC TAC TCC AAG CAC TTC ACC TTG AAA GAC AAG GGT 3901 
Asp Gin Tyr Lys Leu Tyr Ser Lys His Phe Thr Leu Lys Asp Lys Glv 
H85 H90 H95 

TCC CCG CAC AGT GAG ACC AGC GAG CGA TAC CGG CAG AAC TCC ACG CAC 3949 
Ser Pro His Ser Glu Thr Ser Glu Arg Tyr Arg Gin Asn Ser Thr His 
1200 1205 1210 

TGC AGA AGC TGC CTT TCC AAC ATG CCC ACC TAT TCA GGC CAC TTC ACC 3997 
Cys Arg Ser Cys Leu Ser Asn Met Pro Thr Tyr Ser Gly His Phe Thr 
12 15 1220 1225 

ATG AGG TCC CCC TTC AAG TGC GAT GCC TGC CTG CGG ATG GGG AAC CTC 4045 

??™ 9 Pr ° PhS LyS CyS AS P Ala Leu Ar 9 Me t Gly Asn Leu 

1230 1235 1240 1245 

TAT GAC ATC GAT GAA GAC CAG ATG CTT CAG GAG ACA GGT AAC CCA GCC 4093 
Tyr Asp He Asp Glu Asp Gin Met Leu Gin Glu Thr Gly Asn Pro Ala 
1250 1255 1260 

ACC GGG GAG CAG GTC TAC CAG CAG GAC TGG GCA CAG AAC AAT GCC CTT 
Thr Gly Glu Gin Val Tyr Gin Gin Asp Trp Ala Gin Asn Asn Ala Leu 
1265 1270 1275 

CAA TTA CAA AAG AAC AAG CTA AGG ATT AGC CGT CAG CAT TCC TAC GAT 
Gin Leu Gin Lys Asn Lys Leu Arg He Ser Arg Gin His Ser Tyr Asp 
1280 1285 1290 

^ ?*? ? AC ^ ? CT AGG GAG CTA GAC CTT AGC AGG CCC TCC CGG 

Ser i 
1305 

AGC ATA AGC CTC AAG GAC AGG GAA CGG CTT CTG GAG GGA AAT TTT TAC 

T^n LyS ASP Arg Glu Arg Leu Leu Glu Gly Asn Phe Tyr 

1J1U 1 315 1320 1325 

GGC AGC CTG TTT AGT GTC CCC TCA AGC AAA CTC TCG GGG AAA AAA AGC 
Gly Ser Leu Phe Ser Val Pro Ser, Ser Lys Leu Ser Gly Lys Lys Ser 
1330 1335 1340 

TCC CTT TTC CCC CAA GGT CTG GAG GAC AGC AAG AGG AGC AAG TCT CTC 
Ser Leu Phe Pro Gin Gly Leu Glu Asp Ser Lys Arg Ser Lys Ser Leu 
"45 1350 ~ 1355 



tT V, r ~™ x U1A UAC GTT AGC AGG CCC TCC CGG 

ASP LyS Pr ° Arg Giu Leu As P Lei ^ s " Arg Pro Ser Arq 
12 95 1300 ' y 



4141 



4189 



4237 



4285 



4333 



4381 
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TTG CCA GAC CAC ACC TCC GAT AAC CCT TTC CTC CAC TCC CAC AGG GAT 4429 

Leu Pro Asp His Thr Ser Asp Asn Pro Phe Leu His Ser His Arg Asp 
1360 1365 1370 

GAC CAA CGC TTG GTT ATT GGG AGA TGC CCC TCG GAC CCT TAC AAA CAC 447 7 

Asp Gin Arg Leu Vai He Gly Arg Cys Pro Ser Asp Pro Tyr Lys His 
1375 1380 1385 

TCG TTG CCA TCC CAG GCG GTG AAT GAC AGC TAT CTT CGG TCG TCC TTG 452 5 

Ser Leu Pro Ser Gin Ala Val Asn Asp Ser Tyr Leu Arg Ser Ser Leu 
1390 1395 1400 1405 

AGG TCA ACG GCA TCG TAC TGT TCC AGG GAC AGT CGG GGC CAC AAT GAT 4573 
Arg Ser Thr Ala Ser Tyr Cys Ser Arg Asp Ser Arg Gly His Asn Asp 
1410 1415 1420 

GTG TAT ATT TCG GAG CAT GTT ATG CCT TAT GCT GCA AAT AAG AAT AAT 4 621 

Val Tyr He Ser Glu His Val Met Pro Tyr Ala Ala Asn Lys Asn Asn 
1425 1430 1435 

ATG TAC TCT ACC CCC AGG GTT TTA AAT TCC TGC AGC AAT AGA CGC GTG 4669 
Met Tyr Ser Thr Pro* Arg Val Leu Asn Ser Cys Ser Asn Arg Arg Val. 
1440 1445 1450 

TAC AAG GAA ATG CCT AGT ATC GAA TCT GAT GTT TAAAAATCTT CCATTAATGT 4722 
Tyr Lys Glu Met Pro Ser He Glu Ser Asp Val 

1455 1460 146 

TTTATCTATA GGGAAATACA CGTAATGGCC AATGTTCTGG AGGGTAAATG TTGGATGTCC 4782 
AATAGTGCCC TGCTAAGAGG AAGGAG 4808 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1464 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Gly Arg Val Gly Tyr Trp Thr Leu Leu Val Leu Pro Ala Leu Leu 
1 ' 5 10 15 

Val Trp Arg Gly Pro Ala Pro Ser Ala Ala Ala Glu Lys Gly Pro Pro 
20 25 30 

Ala Leu Asn He Ala Val Met Leu Gly His Ser His Asp Val Thr Glu 
35 40 45 

Arg Glu Leu Arg Thr Leu Trp Gly Pro Glu Gin Ala Ala Gly Leu Pro 
50 5 5 60 

Leu Asp Val Asn Val Val Ala Leu Leu Met Asn Arg Thr Asp Pro Lys 
65 70 75 80 

Ser Leu He Thr His Val Cys Asp Leu Met Ser Gly Ala Arg He His 
85 ' , 90 95 

Gly Leu Val Phe Gly Asp Asp Thr Asp Gin Glu Ala Val Ala Gin Met 
100 ~ ' 105 110 

Leu Asp Phe He Ser Ser His Thr Phe Val Pro He Leu Gly He His 
115 120 125 
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Cly Cly Ala Ser Met Xle Met Ala Asp Ly3 Asp ^ Thr ^ ^ ^ 
PJ- Oln PHe Cly Ala s« xi. Cln Gln cln Ala Thr val Met Leu Ly3 
He Met Cln Asp Tyr Asp Trp His Val P he Ser Leu Val Thr Thr 
PHe P,o Cly Tyr Arg Glu phe ^ ^ ^ ^ ^ ^ ^ ^ 

AS " IS V ^ T - ASP SS T ^ Thr 

205 

210 ASP L *« Jf| Val Cln Leu Lys Lys Ile His Ser 

220 

Ser val Xle Leu Leu Tyr C ys Ser Lys Asp clu Ala Val Leu lie Leu 

235 240 

Ser Clu Ala Arg Ser Leu Cl y Leu Thr Cly T y r Asp Phe Pne Trp He 

Val Pro ser Leu Val Ser Cl y Asn Thr Clu Leu He Pro Lys IH Phe 

Pro ser Cly Leu He Ser Val Ser Tyr Asp Asp Tr p Asp Tyr Ser Leu 

280 2Q5 

Clu Ala Arg Val Arg Asp Cly lie Cly xle Leu Thr Thr Ala Ala Ser 

300 

|er Met Leu Clu L y s Phe Ser T y r Xle Pro Clu Ala Lys Ala Ser C y s 

315 320 
Tyr Cly Oln Met Clu Arg Pro Clu Val Pro Met His Tnr Leu His Pro 

Phe Met val Asn Val Thr Trp Asp cly L ys Asp Leu Ser Phe Th^ clu 

Clu Cly Tyr Cln Val His Pro Arg Leu Val Val Xle Val Leu Asn Lys 

ASP Arg Clu Trp Clu Lys Val cly Lys Trp clu Asn His Thr Leu Ser 

J/b 380 

385 A ^ A ^ ^ Lys Ser Phe Ser Asp Cys Clu 

395 400 
P- Asp Asp Asn His Leu Ser Xle Val Thr Leu Clu clu Ala Pro Phe 

Val Xle Val Clu Asp Ile Asp prQ ^ ^ ^ ^ ^ ^ ^ ^ 
Thr val Pro c y s Arg L y s Phe Val Lys Xle Asn Asn Ser Thr Asn Clu 

Cly Met Asn Val Lys Lys Cys c ys L y s Cly Phe Cys ill Agp xle Leu 

^ b , 460 
jy g - Ly. Leu ser Arg Thr Val Lys P he Thr Tyr Asp Leu T y r Leu Val 



480 



116 

Thr Asn Gly Lys His Gly Lys Lys Val Asn Asn Va 1 Trp Asn Gly Met 

490 495 

He Gly GXu Val Val Tyr Gin Arg Ala Val Met Ala Val Gly Ser Leu 

505 510 



Thr lie Asn Glu Glu Arg Ser Glu Val Val Asp Phe Ser Val Pro Phe 

b2 ° 525 
Val GXu Thr Gly He Ser Val Met Val Ser Arg Ser Asn Gly Thr Val 

b35 540 
Ser Pro Ser Ala Phe Leu Glu Pro Phe Ser Ala Ser Val Trp Val Met 

555 5 60 

Met Phe Val Met Leu Leu He Val Ser Ala He Ala Val Trp Val Leu 
565 "0 575 

Asp Tyr Ser Ser Pro Val Gly Tyr Asn Arg Asn Leu Ala Lys Gly Lys 

585 590 

Ala Pro His Gly Pro Ser Phe Thr He Gly Lys Ala He Trp Leu Leu 



605 



Trp Gly Leu Val Phe Asn Asn Ser Val Pro Val Gin Asn Pro Lys Gly 



620 



Thr Thr Ser Lys lie Met Val Ser Val Trp Ala Phe Phe Ala Val He 

635 640 
Phe Leu Ala Ser Tyr Thr Ala Asn Leu Ala Ala Phe Met lie Gin Glu 

Glu Phe Val Asp Gin Val Thr Gly Leu Ser Asp Lys Lys Phe Gin Arg 

665 670 

Pro His Asp Tyr Ser Pro Pro Phe Arg Phe Gly Thr Val Pro Asn Gly 

680 6Q5 

Ser Thr Glu Arg Asn He Arg Asn Asn Tyr Pro Tyr Met His Gin Tyr 

695 700 
Met Thr Lys Phe Asn Gin Lys Gly Val Glu Asp Ala Leu Val Ser Leu 

710 7 15 72 o 

Lys Thr Gly Lys Leu Asp Ala Phe He Tyr Asp Ala Ala Val Leu Asn 

730 735 

Tyr Lys Ala Gly Arg Asp Glu Gly Cys Lys Leu Val Thr He Gly Ser 

745 750 

Gly Tyr lie Phe Ala Thr Thr Gly Tyr Gly He Ala Leu Gin Lys Gly 

/60 ?65 

Ser Pro Trp Lys Arg Gin lie Asp Leu Ala Leu Leu Gin Phe Val Gly 

775 780 
Asp Gly Glu Met Glu Glu Leu Glu Thr Leu Trp Leu Thr Gly He Cys 

795 800 
His Asn Glu Lys Asn Glu Val Met Ser Ser Gin Leu Asp He Asp Asn 



805 



810 



815 



Met Ala Gly Val Phe Tyr Met Leu Ala Ala Ala Met Ala Leu Ser Leu 

825 830 
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II. TIT Ph. He x rp olu His phe Tyr Trp 

8 4 5 

S j ely val c y . ser J., Arg Pro My ^ ^ 

8 6 0 

JJJ „y Ile Iyr ser Cy 3 „. Hi , Gly w ^ l ^ 

r „ 880 

y P " ssp HI »" L - T - «» Jj; «n ... »,„ „. t l.» ly . 

890 895 
yUb 910 

- «.t flop s. r Pro Ly . }n „. „. Aap ph< ^ ^ ^ 

S.r X1 . „. t A . p „. t ftsp ^ Leu H.t Tyr S „ 

JD 940 

» 4 .P -» *r g s. r _ ?he „„ Gly Lys „„ ser fljp ^ ^ 

960 

Asn Glu Leu Gin Thr Phe Val Ala Asn Arc cm r 

965 " Gln L V S Asp Asn Leu Asn 

Asn Tyr Val Phe Gin Gly Gin His Pro Leu Thr- r * 

980 * Leu Thr Leu Asn Glu Ser Asn 

ob 990 

P " *" 3. ^ .lu ser L y , „. A ,„ s „ 

■ LUUU 1005 

Arg Pro Arg Gin Leu Trp Lys Lvs « BP v , » 

1Q 10 P ^XS Ser Val As P Ser He Arg Gin Asp 

" 1020 V 

f- L~ nn A ,„ P „ val .„ „ n ^ au ^ ^ 

Asn Arg Thr His Ser Leu Lvs <?o^ o 

1045 ^ Pr ° tnfn Tyr LSU Pr ° Glu Glu Met 



1050 1055 



Ala His Ser Asp H e Ser- n„ 

1060 G1U Thr ?« *rg Ala Thr Cys His Arg 

1065 1070 

j;?s a - s - «*■ «v *>* >*• ».p Ph. Ly3 Ar9 

1085 

S " IE,"' L "" T - ».p Cy S set „l„ Val C lu Srg T „ r 

1100 

J^L- h ya Thc Ly , ? „ .„ p „ y> ^ r ^ ^ 

1120 

Asp Gly Glu Lys Glu Pro Gly Phe m« r » ' " 

1125 y ^ HlS ^f" As P Pro Pro Gin Phe Val 

1135 

01u vai ijv -° «« »» vji « P p h . Pro A . p pro Tyr oln 

1150 

" P " "Is 01 " *» ^ 1%°* »-P Thr Leu P„ „e t ».„ 

' LbU 1165 
~, J. ?o P« L.„ „. As „ . ; „ 5Glu Gly ^ s ^ flsn ^ p 
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Lys Leu Tyr Ser Lys His Phe Thr Leu Lys Asp Lys Gly Ser Pro His 
H85 1190 H95 ' 1200 

Ser Glu Thr Ser Glu Arg Tyr Arg Gin Asn Ser Thr His Cys Arg Ser 
1205 1210 1215 

Cys Leu Ser Asn Met Pro Thr Tyr Ser Gly His Phe Thr Met Arg Ser 
1220 1225 1230 

Pro Phe Lys Cys Asp Ala Cys Leu Arg Met Gly Asn Leu Tyr Asp He 
1235 1240 ' 1245 

Asp Glu Asp Gin Met Leu Gin Glu Thr Gly Asn Pro Ala Thr Gly Glu 
1250 1255 1260 

Gin Val Tyr Gin Gin Asp Trp Ala Gin Asn Asn Ala Leu Gin Leu Gin 
1265 1270 1275 1280 

Lys Asn Lys Leu Arg He Ser Arg Gin His Ser Tyr Asp Asn lie Val 
1285 1290 " 1295 

Asp Lys Pro Arg GlVLeu Asp Leu Ser Arg Pro Ser Arg Ser lie Ser 
1300 1305 1310 

Leu Lys Asp Arg Glu Arg Leu Leu Glu Gly Asn Phe Tyr Gly Ser Leu 
1315 1320 1325 

Phe Ser Val Pro Ser Ser Lys Leu Ser Gly Lys Lys Ser Ser Leu Phe 
1330 1335 1340 

Pro Gin Gly Leu Glu Asp Ser Lys Arg Ser Lys Ser Leu Leu Pro Asp 
1345 1350 1355 1360 

His Thr Ser Asp Asn Pro Phe Leu His Ser His Arg Asp Asp Gin Arg 
1365 1370 " 1375 

Leu Val He Gly Arg Cys Pro Ser Asp Pro Tyr Lys His Ser Leu Pro 
1380 1385 1390 

Ser Gin Ala Val Asn Asp Ser Tyr Leu Arg Ser Ser Leu Arg Ser Thr 
1395 1400 1405 

Ala Ser Tyr Cys Ser Arg Asp Ser Arg Gly His Asn Asp Val Tyr He 
1410 1415 1420 

Ser Glu His Val Met Pro Tyr Ala Ala Asn Lys Asn Asn Met Tyr Ser 
14 25 1430 1435 1440 

Thr Pro Arg Val Leu Asn Ser Cys Ser Asn Arg Arg Val Tyr Lys Glu 
1445 1450 " 1455 

Met Pro Ser He Glu Ser Asp Val 
1460 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



E S £ S S S S "° s 5? S K SJ 2 S? S 
SJ E 5 S.K 2£ E S E E S =4° S S E E 

35 40 

E SS E J5 S « «S SI? S2 S r S" occ CflC °» - 

45 V val Ala lie Lys Asp Ala His Glu Lys 

SS SS E SS E S E 51? S3 E E GT ? c " ™ «» «* 

60 C Val V ?J Pro Ar 3 Val Glu Leu Val Ala 

70 

s E E E Sip s - s S « « «- jp „ „ „ 

= = E 2; e a s - e e e s « - s - 

100 ~ ^ 

s a s e e a s e e s s s s ss s e 

e e s a e S E EE E E s s s s 5 

130 135 



74 
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(xi, SEQUENCE DESCRIPTION: SEQ ID NO:12: 

CGAGGGAGGC GGCCGGCGCG GACTPTPTtp r-^ 

GACTCTCTTC GCGGGCGCAG CGCCCCTTCC CCCTCGGACC 60 

CTCCGGTGGA CATG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 5538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 210.. 4664 

(xi) SEQUENCE*" DESCRIPTION: SEQ ID NO- 13- 
TTGAATTTGC ATCTCTTCAA GACACAAGAT TAAAACAAAA TTTACGCTAA ATTGGATTTT 
AAATTATCTT CCGTTCATTT ATCCTTCGTC TTTCTTATGT GGATATGCAA GCGAGAAGAA 
GGGACTGGAC ATTCCCAACA TGCTCACTCC CTTAATCTGT CCGTCTAGAG GTTTGGCTTC 
TACAAACCAA GGGAGTCGAC GAGTTGAAG ATG AAG CCC AGA GCG GAG TGC TGT 

Met Lys Pro Arg Ala Glu Cys Cys 



60 
120 
180 
233 

281 

329 

377 

425 



473 



521 



569 



617 
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AAG GAT GAA TCC TCC ATG TTC TTC CAG TTT GGC CCA TCA ATT GAA CAG 665 
Lys Asp Glu Ser Ser Met Phe Phe Gin Phe Gly Pro Ser He Glu Gin 
140 145 150 

CAA GCT TCC GTA ATG CTC AAC ATC ATG GAA GAA TAT GAC TGG TAC ATC 713 
Gin Ala Ser Val Met Leu Asn He Met Glu Glu Tyr Asp Trp Tyr He 
155 160 i 6 5 

TTT TCT ATC GTC ACC ACC TAT TTC CCT GGC TAC CAG GAC TTT GTA AAC 7 61 

Phe Ser He Val Thr Thr Tyr Phe Pro Gly Tyr Gin Asp Phe Val Asn 
170 175 18Q 

AAG ATC CGC AGC ACC ATT GAG AAT AGC TTT GTG GGC TGG GAG CTA GAG 809 
Lys He Arg Ser Thr He Glu Asn Ser Phe Val Gly Trp Glu Leu Glu 
185 190 195 200 

GAG GTC CTC CTA CTG GAC ATG TCC CTG GAC GAT GGA GAT TCT AAG ATC 857 
Glu Val Leu Leu Leu Asp Met Ser Leu Asp Asp Gly Asp Ser Lys He 
205 210 215 

CAG AAT CAG CTC AAG AAA CTT CAA AGC CCC ATC ATT CTT CTT TAC TGT 
Gin Asn Gin Leu Lys Lys Leu Gin Ser Pro He He Leu Leu Tyr Cys 
220 225 230 

ACC AAG GAA GAA GCC ACC TAC ATC TTT GAA GTG GCC AAC TCA GTA GGG 953 
Thr Lys Glu Glu Ala Thr Tyr He Phe Glu Val Ala Asn Ser Val Gly 
235 240 245 

CTG ACT GGC TAT GGC TAC ACG TGG ATC GTG CCC AGT CTG GTG GCA GGG 
Leu Thr Gly Tyr Gly Tyr Thr Trp He Val Pro Ser Leu Val Ala Gly 
250 255 260 

GAT ACA GAC ACA GTG CCT GCG GAG TTC CCC ACT GGG CTC ATC TCT GTA 
Asp Thr Asp Thr Val Pro Ala Glu Phe Pro Thr Gly Leu He Ser Val 
265 270 275 ^ 280 

TCA TAT GAT GAA TGG GAC TAT GGC CTC CCC CCC AGA GTG AGA GAT GGA 109 7 

Ser Tyr Asp Glu Trp Asp Tyr Gly Leu Pro Pro Arg Val Arg Asp Gly 
285 290 295 



905 



1001 



1049 



1145 



1241 



ATT GCC ATA ATC ACC ACT GCT GCT TCT GAC ATG CTG TCT GAG CAC AGC 
He Ala He He Thr Thr Ala Ala Ser Asp Met Leu Ser Glu His Ser 
300 305 310 

TTC ATC CCT GAG CCC AAA AGC AGT TGT TAC AAC ACC CAC GAG AAG AGA 1193 
Phe He Pro Glu Pro Lys Ser Ser Cys Tyr Asn Thr His Glu Lys Arq 
315 320 325 

ATC TAC CAG TCC AAT ATG CTA AAT AGG TAT CTG ATC AAT GTC ACT TTT 
He Tyr Gin Ser Asn Met Leu Asn Arg Tyr Leu He Asn Val Thr Phe 
330 335 340 

GAG GGG AGG AAT TTG TCC TTC AGT GAA GAT GGC TAC CAG ATG CAC CCG 1289 
Glu Gly Arg Asn Leu Ser Phe Ser Glu Asp Gly Tyr Gin Met His Pro 
345 3 50 355 360 

AAA CTG GTG ATA ATT CTT CTG AAC AAG GAG AGG AAG TGG GAA AGG GTG 133 7 

Lys Leu Val He He Leu Leu Asn Lys Glu Arg Lys Trp Glu Arg Val 
365 370 375 

GGG AAG TGG AAA GAC AAG TCC CTG CAG ATG AAG TAC TAT GTG TGG CCC 13 85 

Gly Lys Trp Lys Asp Lys Ser Leu 'Gin Met Lys Tyr Tyr Val Trp Pro 
3 80 385 * 390 

CGA ATG TGT CCA GAG ACT GAA GAG CAG GAG GAT GAC CAT CTG AGC ATT 143 3 

Arg Met Cys Pro Glu Thr Glu Glu Gin Glu Asp Asp His Leu Ser He 
395 400 405 
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GTG ACC CTG GAG GAG GCA CCA TTT GTC ATT GTG GAA AGT GTG GAC CCT 1481 
Val Thr Leu Glu Glu Ala Pro Phe Val lie Val Glu Ser Val Asp Pro 
410 415 420 

CTG AGT GGA ACC TGC ATG AGG AAC ACA GTC CCC TGC CAA AAA CGC ATA 1529 
Leu Ser Gly Thr Cys Met Arg Asn Thr Val Pro Cys Gin Lys Arg lie 
425 430 435 440 

GTC ACT GAG AAT AAA ACA GAC GAG GAG CCG GGT TAC ATC AAA AAA TGC 1577 
Val Thr Glu Asn Lys Thr Asp Glu Glu Pro Gly Tyr lie Lys Lys Cys 
445 450 455 

TGC AAG GGG TTC TGT ATT GAC ATC CTT AAG AAA ATT TCT AAA TCT GTG 1625 
Cys Lys Gly Phe Cys lie Asp lie Leu Lys Lys lie Ser Lys Ser Val 
460 ~ 465 470 

AAG TTC ACC TAT GAC CTT TAC CTG GTT ACC AAT GGC AAG CAT GGG AAG 167 3 

Lys Phe Thr Tyr Asp Leu Tyr Leu Val Thr Asn Gly Lys His Gly Lys 
475 480 485 

AAA ATC AAT GGA ACC TGG AAT GGT ATG ATT GGA GAG GTG GTC ATG AAG 1721 
Lys He Asn Gly Thr* Trp Asn Gly Met He Gly Glu Val Val Met Lys 
490 495 500 

AGG GCC TAC ATG GCA GTG GGC TCA CTC ACC ATC AAT GAG GAA CGA TCG 17 69 

Arq Ala Tyr Met Ala Val Gly Ser Leu Thr He Asn Glu Glu Arg Ser 
505 510 515 520 

GAG GTG GTC GAC TTC TCT GTG CCC TTC ATA GAG ACA GGC ATC AGT GTC 1817 
Glu Val Val Asp Phe Ser Val Pro Phe He Glu Thr Gly He Ser Val 
525 530 535 

ATG GTG TCA CGC AGC AAT GGG ACT GTC TCA CCT TCT GCC TTC TTA GAG 186 5 

Met Val Ser Arg Ser Asn Gly Thr Val Ser Pro Ser Ala Phe Leu Glu 
540 545 550 

CCA TTC AGC GCT GAC GTA TGG GTG ATG ATG TTT GTG ATG CTG CTC ATC 1913 
Pro Phe Ser Ala Asp Val Trp Val Met Met Phe Val Met Leu Leu He 
555 560 565 

GTC TCA GCC GTG GCT GTC TTT GTC TTT GAG TAC TTC AGC CCT GTG GGT 19 61 

Val Ser Ala Val Ala Val Phe Val Phe Glu Tyr Phe Ser Pro Val Gly 
570 575 580 

TAT AAC AGG TGC CTC GCT GAT GGC AGA GAG CCT GGT GGA CCC TCT TTC 2009 
Tyr Asn Arg Cys Leu Ala Asp Gly Arg Glu Pro Gly Gly Pro Ser Phe 
585 590 ~ "* 595 600 

ACC ATC GGC AAA GCT ATT TGG TTG CTC TGG GGT CTG GTG TTT AAC AAC 2057 
Thr He Gly Lys Ala He Trp Leu Leu Trp Gly Leu Val Phe Asn Asn 
605 610 615 

TCC GTA CCT GTG CAG AAC CCA AAG GGG ACC ACC TCC AAG ATC ATG GTG 2105 
Ser Val Pro Val Gin Asn Pro Lys Gly Thr Thr Ser Lys He Met Val 
620 625 630 

TCA GTG TGG GCC TTC TTT GCT GTC ATC TTC CTG GCC AGC TAC ACT GCC 2153 
Ser Val Trp Ala Phe Phe Ala Val He Phe Leu Ala Ser Tyr Thr Ala 
635 640 645 



AAC TTA GCT GCC TTC ATG ATC CAA GAG GAA TAT GTG GAC CAG GTT TCT 
Asn Leu Ala Ala Phe Met He Gln'ciu Glu Tyr Val Asp Gin Val Ser 
650 655 660 



2201 



GGC CTG AGC GAC AAA AAG TTC CAG AGA CCT AAT GAC TTC TCA CCC CCT 2249 
Gly Leu Ser Asp Lys Lys Phe Gin Arg Pro Asn Asp Phe Ser Pro Pro 
665 670 675 680 
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TTC CGC TTT GGG ACC GTG CCC AAC GGC AGC ACA GAG AGA AAT ATT CGC 2297 
Phe Arg Phe Gly Thr Val Pro Asn Gly Ser Thr Glu Arg Asn He Arg 
685 690 695 

AAT AAC TAT GCA GAA ATG CAT GCC TAC ATG GGA AAG TTC AAC CAG AGG 2345 
Asn Asn Tyr Ala Glu Met His Ala Tyr Met Gly Lys Phe Asn Gin Arg 
700 705 710 

GGT GTA GAT GAT GCA TTG CTC TCC CTG AAA ACA GGG AAA CTG GAT GCC 2 393 

Gly Val Asp Asp Ala Leu Leu Ser Leu Lys Thr Gly Lys Leu Asp Ala 
715 *" 720 725 

TTC ATC TAT GAT GCA GCA GTG CTG AAC TAT ATG GCA GGC AGA GAT GAA 2441 
Phe He Tyr Asp Ala Ala Val Leu Asn Tyr Met Ala Gly Arg Asp Glu 
730 ' 735 740 

GGC TGC AAG CTG GTG ACC ATT GGC AGT GGG AAG GTC TTT GCT TCC ACT 2489 
Glv Cys Lys Leu Val Thr He Gly Ser Gly Lys Val Phe Ala Ser Thr 
745 750 755 760 

GGC TAT GGC ATT GCC ATC CAA AAA GAT TCT GGG TGG AAG CGC CAG GTG 2537 
Glv Tyr Gly He AlV He Gin Lys Asp Ser Gly Trp Lys Arg Gin Val 
765 770 775 

GAC CTT GCT ATC CTG CAG CTC TTT GGA GAT GGG GAG ATG GAA GAA CTG 2585 
Asp Leu Ala He Leu Gin Leu Phe Gly Asp Gly Glu Met Glu Glu Leu 
780 785 790 

GAA GCT CTC TGG CTC ACT GGC ATT TGT CAC AAT GAG AAG AAT GAG GTC 2 633 

Glu Ala Leu Trp Leu Thr Gly He Cys His Asn Glu Lys Asn Glu Val 
795 800 805 

ATG AGC AGC CAG CTG GAC ATT GAC AAC ATG GCA GGG GTC TTC TAC ATG 2681 
Met Ser Ser Gin Leu Asp He Asp Asn Met Ala Gly Val Phe Tyr Met 
810 815 820 

TTG GGG GCG GCC ATG GCT CTC AGC CTC ATC ACC TTC ATC TGC GAA CAC 2 729 

Leu Gly Ala Ala Met Ala Leu Ser Leu He Thr Phe He Cys Glu His 
825 830 835 840 

CTT TTC TAT TGG CAG TTC CGA CAT TGC TTT ATG GGT GTC TGT TCT GGC 2777 
Leu Phe Tyr Trp Gin Phe Arg His Cys Phe Met Gly Val Cys Ser Gly 
845 850 855 

AAG CCT GGC ATG GTC TTC TCC ATC AGC AGA GGT ATC TAC AGC TGC ATC 2825 
Lys Pro Gly Met Val Phe Ser He Ser Arg Gly He Tyr Ser Cys He 
860 865 870 

CAT GGG GTG GCG ATC GAG GAG CGC CAG TCT GTA ATG AAC TCC CCC ACC 287 3 

His Gly Val Ala He Glu Glu Arg Gin Ser Val Met Asn Ser Pro Thr 
875 880 885 

GCA ACC ATG AAC AAC ACA CAC TCC AAC ATC CTG CGC CTG CTG CGC ACG 2921 
Ala Thr Met Asn Asn Thr His Ser Asn He Leu Arg Leu Leu Arg Thr 
890 895 900 

GCC AAG AAC ATG GCT AAC CTG TCT GGT GTG AAT GGC TCA CCG CAG AGC 29 69 

Ala Lys Asn Met Ala Asn Leu Ser Gly Val Asn Gly Ser Pro Gin Ser 
905 910 915 920 

GCC CTG GAC TTC ATC CGA CGG GAG, TCA TCC GTC TAT GAC ATC TCA GAG 3017 
Ala Leu Asp Phe He Arg Arg Glu Ser Ser Val Tyr Asp He Ser Glu 
925 930 935 

CAC CGC CGC AGC TTC ACG CAT TCT GAC TGC AAA TCC TAC AAC AAC CCG 3065 
His Arg Arg Ser Phe Thr His Ser Asp Cys Lys Ser Tyr Asn Asn Pro 
940 945 950 



• * 
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CCC TGT GAG GAG AAC CTC TTC AGT GAC TAC ATC AGT GAG GTA GAG AGA 3113 

Pro Cys Glu Glu Asn Leu Phe Ser Asp Tyr He Ser Glu Val Glu Arg 

955 960 965 

ACG TTC GGG AAC CTG CAG CTG AAG GAC AGC AAC GTG TAC CAA GAT CAC 3161 

Thr Phe Gly Asn Leu Gin Leu Lys Asp Ser Asn Val Tyr Gin Asp His 

970 "* 975 980 



3209 



TAC CAC CAT CAC CAC CGG CCC CAT AGT ATT GGC AGT GCC AGC TCC ATC 
Tyr His His His His Arg Pro His Ser He Gly Ser Ala Ser Ser lie 
985 990 995 1000 

GAT GGG CTC TAC GAC TGT GAC AAC CCA CCC TTC ACC ACC CAG TCC AGG 3257 
Asp Gly Leu Tyr Asp Cys Asp Asn Pro Pro Phe Thr Thr Gin Ser Arg 
1005 1010 1015 

TCC ATC AGC AAG AAG CCC CTG GAC ATC GGC CTC CCC TCC TCC AAG CAC 3305 
Ser lie Ser Lys Lys Pro Leu Asp He Gly Leu Pro Ser Ser Lys His 
1020 1025 1030 

AGC CAG CTC AGT GAC CTG TAC GGC AAA TTC TCC TTC AAG AGC GAC CGC 3353 
Ser Gin Leu Ser Asp^Leu Tyr Gly Lys Phe Ser Phe Lys Ser Asp Arg 
1035 1040 1045 

TAC AGT GGC CAC GAC GAC TTG ATC CGC TCC GAT GTC TCT GAC ATC TCA 3401 
Tyr Ser Gly His Asp Asp Leu lie Arg Ser Asp Val Ser Asp lie Ser 
1050 1055 1060 

ACC CAC ACC GTC ACC TAT GGG AAC ATC GAG GGC AAT GCC GCC AAG AGG 3449 
Thr His Thr Val Thr Tyr Gly Asn lie Glu Gly Asn Ala Ala Lys Arg 
1065 1070 1075 1080 

CGT AAG CAG CAA TAT AAG GAC AGC CTG AAG AAG CGG CCT GCC TCG GCC 3497 
Arq Lys Gin Gin Tyr Lys Asp Ser Leu Lys Lys Arg Pro Ala Ser Ala 
1085 1090 1095 

AAG TCC CGC AGG GAG TTT GAC GAG ATC GAG CTG GCC TAC CGT CGC CGA 3545 
Lys Ser Arg Arg Glu Phe Asp Glu He Glu Leu Ala Tyr Arg Arg Arg 
1100 *" H05 lHO 

CCG CCC CGC TCC CCT GAC CAC AAG CGC TAC TTC AGG GAC AAG GAA GGG 3593 
Pro Pro Arg Ser Pro Asp His Lys Arg Tyr Phe Arg Asp Lys Glu Gly 
1115 H20 H25 

CTA CGG GAC TTC TAC CTG GAC CAG TTC CGA ACA AAG GAG AAC TCA CCC 3 641 

Leu Arg Asp Phe Tyr Leu Asp Gin Phe Arg Thr Lys Glu Asn Ser Pro 
1130 " H35 H40 

CAC TGG GAG CAC GTA GAC CTG ACC GAC ATC TAC AAG GAG CGG AGT GAT 3 689 

His Trp Glu His Val Asp Leu Thr Asp lie Tyr Lys Glu Arg Ser Asp 
1145 1150 1155 1160 

GAC TTT AAG CGC GAC TCC ATC AGC GGA GGA GGG CCC TGT ACC AAC AGG 3737 
Asp Phe Lys Arg Asp Ser He Ser Gly Gly Gly Pro Cys Thr Asn Arg 
1165 H70 1175 

TCT CAC ATC AAG CAC GGG ACG GGC GAC AAA CAC GGC GTG GTC AGC GGG 3 785 

Ser His He Lys His Gly Thr Gly Asp Lys His Gly Val Val Ser Gly 
1180 H85 H90 

GTA CCT GCA CCT TGG GAG AAG AAC , CTG ACC AAC GTG GAG TGG GAG GAC 3833 
Val Pro Ala Pro Trp Glu Lys Asn Leu Thr Asn Val Glu Trp Glu Asp 
1195 " 1200 1205 

CGG TCC GGG GGC AAC TTC TGC CGC AGC TGT CCC TCC AAG CTG CAC AAC 3881 
Arg Ser Gly Gly Asn Phe Cys Arg Ser Cys Pro Ser Lys Leu His Asn 
1210 J 1215 1220 
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TAC TCC ACG ACG GTG ACG GGT CAG AAC TCG GGC AGG CAG GCG TGC ATC 3929 

Tyr Ser Thr Thr Val Thr Gly Gin Asn Ser Gly Arg Gin Ala Cys lie 
1225 1230 1235 1240 

CGG TGT GAG GCT TGC AAG AAA GCA GGC AAC CTG TAT GAC ATC AGT GAG 3977 
Arg Cys Glu Ala Cys Lys Lys Ala Gly Asn Leu Tyr Asp lie Ser Glu 
1245 1250 1255 

GAC AAC TCC CTG CAG GAA CTG GAC CAG CCG GCT GCC CCA GTG GCG GTG 402 5 

Asp Asn Ser Leu Gin Glu Leu Asp Gin Pro Ala Ala Pro Val Ala Val 
1260 1265 1270 

ACG TCA AAC GCC TCC ACC ACT AAG TAC CCT CAG AGC CCG ACT AAT TCC 4073 
Thr Ser Asn Ala Ser Thr Thr Lys Tyr Pro Gin Ser Pro Thr Asn Ser 
1275 1280 1285 

AAG GCC CAG AAG AAG AAC CGG AAC AAA CTG CGC CGG CAG CAC TCC TAC 4121 
Lys Ala Gin Lys Lys Asn Arg Asn Lys Leu Arg Arg Gin His Ser Tyr 
1290 " " 1295 1300 

GAC ACC TTC GTG GAC CTG CAG AAG GAA GAA GCC GCC CTG GCC CCG CGC 4169 
Asp Thr Phe Val AspTLeu Gin Lys Glu Glu Ala Ala Leu Ala Pro Arg 
1305 1310 1315 1320 

AGC GTA AGC CTG AAA GAC AAG GGC CGA TTC ATG GAT GGG AGC CCC TAC 4217 
Ser Val Ser Leu Lys Asp Lys Gly Arg Phe Met Asp Gly Ser Pro Tyr 
1325 1330 1335 

GCC CAC ATG TTT GAG ATG TCA GCT GGC GAG AGC ACC TTT GCC AAC AAC 42 6 5 

Ala His Met Phe Glu Met Ser Ala Gly Glu Ser Thr Phe Ala Asn Asn 
1340 1345 1350 

AAG TCC TCA GTG CCC ACT GCC GGA CAT CAC CAC CAC AAC AAC CCC GGC 4313 
Lys Ser Ser Val Pro Thr Ala Gly His His His His Asn Asn Pro Gly 
1355 1360 1365 

GGC GGG TAC ATG CTC AGC AAG TCG CTC TAC CCT GAC CGG GTC ACG CAA 4361 
Gly Gly Tyr Met Leu Ser Lys Ser Leu Tyr Pro Asp Arg Val Thr Gin 
1370 1375 1380 

AAC CCT TTC ATC CCC ACT TTT GGG GAC GAC CAG TGC TTG CTC CAT GGC 4409 
Asn Pro Phe lie Pro Thr Phe Gly Asp Asp Gin Cys Leu Leu His Gly 
1385 1390 1395 1400 

AGC AAA TCC TAC TTC TTC AGG CAG CCC ACG GTG GCG GGG GCG TCG AAA 44 5 7 

Ser Lys Ser Tyr Phe Phe Arg Gin Pro Thr Val Ala Gly Ala Ser Lys 
1405 1410 1415 

GCC AGG CCG GAC TTC CGG GCC CTT GTC ACC AAC AAG CCG GTG GTC TCG 4505 
Ala Arg Pro Asp Phe Arg Ala Leu Val Thr Asn Lys Pro Val Val Ser 
1420 ~ 1425 1430 

GCC CTT CAT GGG GCC GTG CCA GCC CGT TTC CAG . AAG GAC ATC TGT ATA 455 3 

Ala Leu His Gly Ala Val Pro Ala Arg Phe Gin Lys Asp lie Cys lie 
1435 1440 1445 

GGG AAC CAG TCC AAC CCC TGT GTG CCT AAC AAC ACA AAC CCC AGG GCT 4 601 

Gly Asn Gin Ser Asn Pro Cys Val Pro Asn Asn Thr Asn Pro Arg Ala 
1450 1455 1460 

TTC AAT GGC TCC AGC AAT GGG CAT 'GTT TAT GAG AAA CTT TCT AGT ATT 4649 
Phe Asn Gly Ser Ser Asn Gly His Val Tyr Glu Lys Leu Ser Ser lie 
1465 " 1470 1475 1480 

GAG TCT GAT GTC TGAGTGAGGG AACAGAGAGG TTAAGGTGGG TACGGGAGGG 4701 
Glu Ser Asp Val 

148 
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TAAGGCTGTG GGTCGCGTGA TGCGCATGTC ACGGAGGGTG ACGGGGGTGA ACTTGGTTCC 4761 

CATTTGCTCC TTTCTTGTTT TAATTTATTT ATGGGATCCT GGAGTTCTGG TTCCTACTGG 4821 

GGGCAACCCT GGTGACCAGC ACCATCTCTC CTCCTTTTCA CAGTTCTCTC CTTCTTCCCC 4881 

CCGCTGTCAG CCATTCCTGT TCCCATGAGA TGATGCCATG GGCCCTCTCA GCAGGGGAGG 4941 

GTAGAGCGGA GAAAGGAAGG GCTGCATGCG GGCTTCCTCC TGGTGTGGAA GAGCTCCTTG 5001 

ATATCCTCTT TGAGTGAAGC TGGGAG AACC AAAAAG AGGC TATGTG AGCA CAAAGGTAGC 5061 

TTTTCCCAAA CTGATCTTTT CATTTAGGTG AGGAAGCAAA AGCATCTATG TGAGACCATT 5121 

TAGCACACTG CTTGTGAAAG GAAAGAGGCT CTGGCTAAAT TCATGCTGCT TAGATGACAT 5181 

CTGTCTAGGA ATCATGTGCC AAGCAGAGGT TGGG AGGCCA TTTGTGTTTA TATATAAGCC 5241 

CAAAAATGCT TGCTTCAACC CCATGAGACT CGATAGTGGT GGTGAACAGA ACCCAAGGTC 5 301 

ATTGGTGGCA GAGTGGATTC TTGAACAAAC TGGAAAGTAC GTTATGATAG TGTCCCCCGG 5361 

TGCCTTGGGG ACAAGAGCAG GTGGATTGTG CGTGCATGTG TGTTCATGCA CACTTGCACC 5421 

CATGTGTAGT CAGGTGCCTC AAGAGAAGGC AACCTTGACT CTTTCGTTGA ATTTGCATCT 5481 

CTTCAAGACA CAAGATTAAA ACAAAATTTA CGCTAAATTG GATTTTAAAT TATCTTC 5538 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1484 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Lys Pro Arg Ala Glu Cys Cys Ser Pro Lys Phe Trp Leu Val Leu 
15 10 15 

Ala Val Leu Ala Val Ser Gly Ser Arg Ala Arg Ser Gin Lys Ser Pro 
20 25 30 

Pro Ser lie Gly lie Ala Val lie Leu Val Gly Thr Ser Asp Glu Val 
35 " 40 45 

Ala lie Lys Asp Ala His Glu Lys Asp Asp Phe His His Leu Ser Val 
50 55 60 

Val Pro Arg Val Glu Leu Val Ala Met Asn Glu. Thr Asp Pro Lys Ser 
65 " 70 75 80 

lie lie Thr Arg lie Cys Asp Leu Met Ser Asp Arg Lys lie Gin Gly 
85 90 95 

Val Val Phe Ala Asp Asp Thr Asp Gin Glu Ala He Ala Gin He Leu 
100 105 110 

Asp Phe He Ser Ala Gin Thr Leu Thr Pro He Leu Gly He His Gly 
115 120 125 

Gly Ser Ser Met He Met Ala Asp Lys Asp Glu Ser Ser Met Phe Phe 
130 135 140 
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Gin Phe Gly Pro Ser lie Glu Gin Gin Ala Ser Val Met Leu Asn lie 

145 150 155 160 

Met Glu Glu Tyr Asp Trp Tyr lie Phe Ser lie Val Thr Thr Tyr Phe 

165 170 175 

Pro Gly Tyr Gin Asp Phe Val Asn Lys lie Arg Ser Thr lie Glu Asn 

180 185 190 

Ser Phe Val Gly Trp Glu Leu Glu Glu Val Leu Leu Leu Asp Met Ser 

195 200 205 

Leu Asp Asp Gly Asp Ser Lys lie Gin Asn Gin Leu Lys Lys Leu Gin 

210 215 220 

Ser Pro lie lie Leu Leu Tyr Cys Thr Lys Glu Glu Ala Thr Tyr lie 

225 230 235 240 

Phe Glu Val Ala Asn Ser Val Gly Leu Thr Gly Tyr Gly Tyr Thr Trp 



lie Val Pro Ser Leu" Val Ala Gly Asp Thr Asp Thr Val Pro Ala Glu 
260 265 * 270 

Phe Pro Thr Gly Leu lie Ser Val Ser Tyr Asp Glu Trp Asp Tyr Gly 
275 280 " 285 

Leu Pro Pro Arg Val Arg Asp Gly lie Ala lie lie Thr Thr Ala Ala 
290 295 300 

Ser Asp Met Leu Ser Glu His Ser Phe lie Pro Glu Pro Lys Ser Ser 
305 310 315 320 

Cys Tyr Asn Thr His Glu Lys Arg lie Tyr Gin Ser Asn Met Leu Asn 
325 330 335 

Arg Tyr Leu lie Asn Val Thr Phe Glu Gly Arg Asn Leu Ser Phe Ser 
340 345 350 

Glu Asp Gly Tyr Gin Met His Pro Lys Leu Val He He Leu Leu Asn 
355 360 365 

Lys Glu Arg Lys Trp Glu Arg Val Gly Lys Trp Lys Asp Lys Ser Leu 
370 375 380 

Gin Met Lys Tyr Tyr Val Trp Pro Arg Met Cys Pro Glu Thr Glu Glu 
3Q 5 390 395 400 

Gin Glu Asp Asp His Leu Ser He Val Thr Leu Glu Glu Ala Pro Phe 
405 410 415 

Val lie Val Glu Ser Val Asp Pro Leu Ser Gly Thr Cys Met Arg Asn 
420 425 _ . 430 

Thr Val Pro Cys Gin Lys Arg He Val Thr Glu Asn Lys Thr Asp Glu 
435 440 445 

Glu Pro Gly Tyr He Lys Lys Cys Cys Lys Gly Phe Cys He Asp He 
450 455 460 

Leu Lys Lys He Ser Lys Ser Val ,Lys Phe Thr Tyr Asp Leu Tyr Leu 
465 470 475 ' 480 

Val Thr Asn Gly Lys His Gly Lys Lys He Asn Gly Thr Trp Asn Gly 



245 



250 



255 



485 



490 



495 
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Met lie Gly Glu Val Val Met Lys Arg Ala Tyr Met Ala Val Gly Ser 
500 505 510 

Leu Thr lie Asn Glu Glu Arg Ser Glu Val Val Asp Phe Ser Val Pro 
515 520 525 

Phe lie Glu Thr Gly lie Ser Val Met Val Ser Arg Ser Asn Gly Thr 
530 535 540 

Val Ser Pro Ser Ala Phe Leu Glu Pro Phe Ser Ala Asp Val Trp Val 
545 550 555 560 

Met Met Phe Val Met Leu Leu lie Val Ser Ala Val Ala Val Phe Val 
565 570 575 

Phe Glu Tyr Phe Ser Pro Val Gly Tyr Asn Arg Cys Leu Ala Asp Gly 
580 585 590 

Arg Glu Pro Gly Gly Pro Ser Phe Thr lie Gly Lys Ala lie Trp Leu 
595 ^ 600 605 

Leu Trp Gly Leu VaT Phe Asn Asn Ser Val Pro Val Gin Asn Pro Lys 
610 615 620 

Gly Thr Thr Ser Lys lie Met Val Ser Val Trp Ala Phe Phe Ala Val 
625 630 635 640 

lie Phe Leu Ala Ser Tyr Thr Ala Asn Leu Ala Ala Phe Met lie Gin 
645 650 655 

Glu Glu Tyr Val Asp Gin Val Ser Gly Leu Ser Asp Lys Lys Phe Gin 
660 665 670 

Arg Pro Asn Asp Phe Ser Pro Pro Phe Arg Phe Gly Thr Val Pro Asn 
675 680 685 

Gly Ser Thr Glu Arg Asn lie Arg Asn Asn Tyr Ala Glu Met His Ala 
690 695 700 

Tyr Met Gly Lys Phe Asn Gin Arg Gly Val Asp Asp Ala Leu Leu Ser 
705 ~ 710 715 720 

Leu Lys Thr Gly Lys Leu Asp Ala Phe lie Tyr Asp Ala Ala Val Leu 
725 730 735 

Asn Tyr Met Ala Gly Arg Asp Glu Gly Cys Lys Leu Val Thr lie Gly 
740 745 750 

Ser Gly Lys Val Phe Ala Ser Thr Gly Tyr Gly He Ala He Gin Lys 
755 760 765 

Asp Ser Gly Trp Lys Arg Gin Val Asp Leu Ala He Leu Gin Leu Phe 
770 775 780 

Gly Asp Gly Glu Met Glu Glu Leu Glu Ala Leu Trp Leu Thr Gly He 
785 790 795 800 

Cys His Asn Glu Lys Asn Glu Val Met Ser Ser Gin Leu Asp He Asp 
805 810 815 

Asn Met Ala Gly Val Phe Tyr Met 'Leu Gly Ala Ala Met Ala Leu Ser 
820 " 825 830 

Leu He Thr Phe He Cys Glu His Leu Phe Tyr Trp Gin Phe Arg His 
835 840 845 
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Cys Phe Met Gly Val Cys Ser Gly Lys Pro Gly Met Val Phe Ser He 
850 855 860 

Ser Arq Gly He Tyr Ser Cys He His Gly Val Ala He Glu Glu Arg 
865 870 875 880 

Gin Ser Val Met Asn Ser Pro Thr Ala Thr Met Asn Asn Thr His Ser 

885 890 895 

Asn He Leu Arg Leu Leu Arg Thr Ala Lys Asn Met Ala Asn Leu Ser 
900 905 910 

Gly Val Asn Gly Ser Pro Gin Ser Ala Leu Asp Phe He Arg Arg Glu 
915 ~ 920 925 

Ser Ser Val Tyr Asp He Ser Glu His Arg Arg Ser Phe Thr His Ser 
930 " " 935 940 

Asp Cys Lys Ser Tyr Asn Asn Pro Pro Cys Glu Glu Asn Leu Phe Ser 
945 950 955 960 

Asp Tyr He Ser Glu-" Val Glu Arg Thr Phe Gly Asn Leu Gin Leu Lys 
965 " 970 975 

Asp Ser Asn Val Tyr Gin Asp His Tyr His His His His Arg Pro His 
980 985 990 

Ser He Gly Ser Ala Ser Ser He Asp Gly Leu Tyr Asp Cys Asp Asn 
995 1000 1005 

Pro Pro Phe Thr Thr Gin Ser Arg Ser He Ser Lys Lys Pro Leu Asp 
1010 1015 1020 

He Gly Leu Pro Ser Ser Lys His Ser Gin Leu Ser Asp Leu Tyr Gly 
1025 1030 1035 1040 

Lys Phe Ser Phe Lys Ser Asp Arg Tyr Ser Gly His Asp Asp Leu He 
1045 1050 1055 

Arg Ser Asp Val Ser Asp He Ser Thr His Thr Val Thr Tyr Gly Asn 
1060 1065 1070 

He Glu Gly Asn Ala Ala Lys Arg Arg Lys Gin Gin Tyr Lys Asp Ser 
1075 1080 1085 

Leu Lys Lys Arg Pro Ala Ser Ala Lys Ser Arg Arg Glu Phe Asp Glu 
1090 ~ 1095 1100 

He Glu Leu Ala Tyr Arg Arg Arg Pro Pro Arg Ser Pro Asp His Lys 
1105 1H0 1115 1120 

Arq Tyr Phe Arg Asp Lys Glu Gly Leu Arg Asp Phe Tyr Leu Asp Gin 
1125 ' H30 . 1135 

Phe Arg Thr Lys Glu Asn Ser Pro His Trp Glu His Val Asp Leu Thr 
1140 1145 1150 

Asp He Tyr Lys Glu Arg Ser Asp Asp Phe Lys Arg Asp Ser He Ser 
1155 1160 1165 

Gly Gly Gly Pro Cys Thr Asn Arg, Ser His He Lys His Gly Thr Gly 
1170 1175 1180 

Asp Lys His Gly Val Val Ser Gly Val Pro Ala Pro Trp Glu Lys Asn 
1185 ' 1190 H95 1200 



# 
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Leu Thr Asn Val Glu Trp Glu Asp Arg Ser Gly Gly Asn Phe Cys Arg 
1205 1210 1215 

Ser Cys Pro Ser Lys Leu His Asn Tyr Ser Thr Thr Val Thr Gly Gin 
1220 1225 1230 

Asn Ser Gly Arg Gin Ala Cys lie Arg Cys Glu Ala Cys Lys Lys Ala 
1235 1240 1245 

Gly Asn Leu Tyr Asp lie Ser Glu Asp Asn Ser Leu Gin Glu Leu Asp 
1250 ~ 1255 1260 

Gin Pro Ala Ala Pro Val Ala Val Thr Ser Asn Ala Ser Thr Thr Lys 
1265 1270 1275 1280 

Tyr Pro Gin Ser Pro Thr Asn Ser Lys Ala Gin Lys Lys Asn Arg Asn 
1285 1290 1295 

Lys Leu Arg Arg Gin His Ser Tyr Asp Thr Phe Val Asp Leu Gin Lys 
1300 1305 1310 

Glu Glu Ala Ala LeiTAla Pro Arg Ser Val Ser Leu Lys Asp Lys Gly 
1315 1320 1325 

Arg Phe Met Asp Gly Ser Pro Tyr Ala His Met Phe Glu Met Ser Ala 
1330 *" ~ 1335 1340 

Gly Glu Ser Thr Phe Ala Asn Asn Lys Ser Ser Val Pro Thr Ala Gly 
1345 1350 1355 1360 

His His His His Asn Asn Pro Gly Gly Gly Tyr Met Leu Ser Lys Ser 
1365 1370 1375 

Leu Tyr Pro Asp Arg Val Thr Gin Asn Pro Phe lie Pro Thr Phe Gly 
1380 1385 1390 

Asp Asp Gin Cys Leu Leu His Gly Ser Lys Ser Tyr Phe Phe Arg Gin 
1395 1400 1405 

Pro Thr Val Ala Gly Ala Ser Lys Ala Arg Pro Asp Phe Arg Ala Leu 
1410 1415 1420 

Val Thr Asn Lys Pro Val Val Ser Ala Leu His Gly Ala Val Pro Ala 
1425 1430 1435 1440 

Arg Phe Gin Lys Asp lie Cys He Gly Asn Gin Ser Asn Pro Cys Val 
1445 ~ 1450 1455 

Pro Asn Asn Thr Asn Pro Arg Ala Phe Asn Gly Ser Ser Asn Gly His 
1460 1465 1470 

Val Tyr Glu Lys Leu Ser Ser He Glu Ser Asp Val 
1475 1480 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

<ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 485.. 4495 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CGAGAACACA GCGAGTGTGT GAGTCCCTCC CGCTCCAGCT CCTCCAAGCC GCGGCCGCCG 60 

CCGCCACCCT CGCCCGCAGC CTCCCGCAGC CTCCCTCGGC CACCGGTGTC TGGTGGGGGT 120 

GTTGCCTGGG TAGGTCGGCC CGGCCCCCAG GGGTCTCTCG AGCGTCTGCC ATCTGCCCGA 180 

GAAACATGTG TGGCCACGTC CTCGCCTAGT CCAGGTGGCC GCAACCTTGG GGGAGAGACA 2 40 

GGGCAGGACA GGACCAAGGT AAGAGGTAAG GAGGAGACGG CGCCAGGGAC AGACAGGAGG 300 

TCCCGGCTTG CCGTTGTGCG CACCACCACT GCCGCCGCCC CGGGGCCTGC CCCCGACATC 360 

GGCTCTCTGA GCCCTCCTCG GAATCTTGGG GTCGCTGGAC GCCGGG TTCC GGTCCTGGCC 420 

CCCCCGCCAT CCCCCCAACA GAACAGGGTC ATGAAAAGAG GCCGCCCGGC GGGGCCCGCA 480 

GGCG ATG CGC GGC GCC GGT GGC CCC CGC GGC CCT CGG GGC CCC GCT AAG 529 
Met Arg Gly Ala Gly Gly Pro Arg Gly Pro Arg Gly Pro Ala Lys 
1 ' 5 10 15 

ATG CTG CTG CTG CTG GCG CTG GCC TGC GCC AGC CCG TTC CCG GAG GAG 57 7 

Met Leu Leu Leu Leu Ala Leu Ala Cys Ala Ser Pro Phe Pro Glu Glu 
20 A 2 5 30 

GCG CCG GGG CCG GGC GGG GCC GGT GGG CCC GGC GGC GGC CTC GGC GGG 62 5 

Ala Pro Gly Pro Gly Gly Ala Gly Gly Pro Gly Gly Gly Leu Gly Gly 
35 40 45 

GCG CGG CCG CTC AAC GTG GCG CTC GTG TTC TCG GGG CCC GCG TAC GCG 673 
Ala Arg Pro Leu Asn Val Ala Leu Val Phe Ser Gly Pro Ala Tyr Ala 
50 55 60 

GCC GAG GCG GCA CGC CTG GGC CCG GCC GTG GCG GCG GCG GTG CGC AGC 721 
Ala Glu Ala Ala Arg Leu Gly Pro Ala Val Ala Ala Ala Val Arg Ser 
65 ~ 70 75 

CCG GGC CTA GAC GTG CGG CCC GTG GCG CTG GTG CTC AAC GGC TCG GAC 7 69 

Pro Gly Leu Asp Val Arg Pro Val Ala Leu Val Leu Asn Gly Ser Asp 
80 ~ 85 90 " 95 

CCG CGC AGC CTC GTG CTG CAG CTC TGC GAC CTG CTG TCG GGG TTG CGC 817 
Pro Arg Ser Leu Val Leu Gin Leu Cys Asp Leu Leu Ser Gly Leu Arg 
100 105 * 110 

GTG CAC GGC GTG GTC TTC GAA GAC GAC TCG CGC GCG CCC GCC GTC GCG 865 
Val His Gly Val Val Phe Glu Asp Asp Ser Arg Ala Pro Ala Val Ala 
115 120 125 

CCC ATC CTC GAC TTC CTG TCG GCG CAG ACC TCG CTC CCC ATC GTG TCC 913 
Pro lie Leu Asp Phe Leu Ser Ala Gin Thr Ser Leu Pro lie Val Ser 
130 135 140 

GAG CAC GGC GGC GCC GCG CTC GTG CTC ACG CCC AAG GAG AAG GGC TCC 961 
Glu His Gly Gly Ala Ala Leu Val, Leu Thr Pro Lys Glu Lys Gly Ser 
145 ~ " 150 155 

ACC TTC CTC CAC CTG GGC TCT TCC CCC GAG CAA CAG CTT CAG GTC ATC 1009 
Thr Phe Leu His Leu Gly Ser Ser Pro Glu Gin Gin Leu Gin Val lie 
160 165 170 175 
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TTT GAG GTG CTG GAG GAG TAT GAC TGG ACG TCC TTT GTA GCC GTG ACC 105 7 

Phe Glu Val Leu Glu Glu Tyr Asp Trp Thr Ser Phe Val Ala Val Thr 

180 185 190 

ACT CGT GCC CCT GGC CAC CGG GCC TTC CTG TCC TAC ATT GAG GTG CTG 1105 
Thr Arg Ala Pro Gly His Arg Ala Phe Leu Ser Tyr He Glu Val Leu 
195 200 205 

ACT GAC GGC AGT CTG GTG GGC TGG GAG CAC CGC GGA GCG CTG ACG CTG 1153 
Thr Asp Gly Ser Leu Val Gly Trp Glu His Arg Gly Ala Leu Thr Leu 
210 215 220 

GAC CCT GGG GCG GGC GAG GCC GTG CTC AGT GCC CAG CTC CGC AGT GTC 1201 
Asp Pro Gly Ala Gly Glu Ala Val Leu Ser Ala Gin Leu Arq Ser Val 
225 230 235 

AGC GCG CAG ATC CGC CTG CTC TTC TGC GCC CGA GAG GAG GCC GAG CCC 1249 
Ser Ala Gin He Arg Leu Leu Phe Cys Ala Arg Glu Glu Ala Glu Pro 
240 245 250 255 

GTG TTC CGC GCA GCT GAG GAG GCT GGC CTC ACT GGA TCT GGC TAC GTC 1297 
Val Phe Arg Ala_Ala Glu Glu Ala Gly Leu Thr Gly Ser Gly Tyr 'Val 
260 265 ' 270 

TGG TTC ATG GTG GGG CCC CAG CTG GCT GGA GGC GGG GGC TCT GGG GCC 1345 
Trp Phe Met Val Gly Pro Gin Leu Ala Gly Gly Gly Gly Ser Gly Ala 
275 280 * ' 285 

CCT GGT GAG CCC CCT CTT CTG CCA GGA GGC GCC CCC CTG CCT GCC GGG 1393 
Pro Gly Glu Pro Pro Leu Leu Pro Gly Gly Ala Pro Leu Pro Ala Gly 
290 295 300 

CTG TTT GCA GTG CGC TCG GCT GGC TGG CGG GAT GAC CTG GCT CGG CGA 1441 
Leu Phe Ala Val Arg Ser Ala Gly Trp Arg Asp Asp Leu Ala Arq Arq 
305 310 315 

GTG GCA GCT GGC GTG GCC GTA GTG GCC AGA GGT GCC CAG GCC CTG CTG 1489 
Val Ala Ala Gly Val Ala Val Val Ala Arg Gly Ala Gin Ala Leu Leu 
3 20 325 330 335 

CGT GAT TAT GGT TTC CTT CCT GAG CTC GGC CAC GAC TGT CGC GCC CAG 1537 
Arg Asp Tyr Gly Phe Leu Pro Glu Leu Gly His Asp Cys Arg Ala Gin 
340 345 L ^ 350 

AAC CGC ACC CAC CGC GGG GAG AGT CTG CAT AGG TAC TTC ATG AAC ATC 1585 
Asn Arg Thr His Arg Gly Glu Ser Leu His Arg Tyr Phe Met Asn He 
355 360 365 

ACG TGG GAT AAC CGG GAT TAC TCC TTC AAT GAG GAC GGC TTC CTA GTG 1633 
Thr Trp Asp Asn Arg Asp Tyr Ser Phe Asn Glu Asp Gly Phe Leu Val 
370 375 380 

AAC CCC TCC CTG GTG GTC ATC TCC CTC ACC AGA GAC AGG ACG TGG GAG 1681 
Asn Pro Ser Leu Val Val He Ser Leu Thr Arg Asp Arg Thr Trp Glu 
385 390 395 * 

GTG GTG GGC AGC TGG GAG CAG CAG ACG CTC CGC CTC AAG TAC CCG CTG 172 9 

Val Val Gly Ser Trp Glu Gin Gin Thr Leu Arg Leu Lys Tyr Pro Leu 
400 4 05 410 415 

TGG TCC CGC TAT GGT CGC TTC CTG CAG CCA GTG GAC GAC ACG CAG CAC 17 7 7 

Trp Ser Arg Tyr Gly Arg Phe Levi Gin Pro Val Asp Asp Thr Gin His 
4 20 425 " 430 

CTC GCG GTG GCC ACG CTG GAG GAA AGG CCG TTT GTC ATC GTG GAG CCT 1825 
Leu Ala Val Ala Thr Leu Glu Glu Arg Pro Phe Val He Val Glu Pro 
4 35 440 445 



# 
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GCA GAC CCT ATC AGC GGC ACC TGC ATC CGA GAC TCC GTC CCC TGC CGG 1873 
Ala Asp Pro lie Ser Gly Thr Cys He Arg Asp Ser Val Pro Cys Arg 
^ 1 - - 455 4 60 



450 



AGC CAG CTC AAC CGA ACC CAC AGC CCT CCA CCG GAT GCC CCC CGC CCG 1921 
Ser Gin Leu Asn Arg Thr His Ser Pro Pro Pro Asp Ala Pro Arg Pro 

470 475 



465 



GAA AAG CGC TGC TGC AAG GGT TTC TGC ATC GAC ATT CTG AAG CGG CTG 19 69 
Glu Lys Arg Cys Cys Lys Gly Phe Cys He Asp He Leu Lys Arg Leu 
J 485 490 495 
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GCG CAT ACC ATC GGC TTC AGC TAC GAC CTC TAC CTG GTC ACC AAT GGC 2017 
Ala His Thr He Gly Phe Ser Tyr Asp Leu Tyr Leu Val Thr Asn Gly 
500 505 510 

AAG CAC GGA AAG AAG ATC GAT GGC GTC TGG AAC GGC ATG ATC GGG GAG 2065 
Lys His Gly Lys Lys He Asp Gly Val Trp Asn Gly Met He Gly Glu 
515 520 525 

GTG TTC TAC CAG CGC .GCA GAC ATG GCC ATC GGC TCC CTC ACC ATC AAC 2113 
Val Phe Tyr Gin Arg^Ala Asp Met Ala He Gly Ser Leu Thr He Asn 
' 535 540 



530 



GAG GAG CGC TCC GAG ATC GTG GAC TTC TCC GTC CCC TTC GTG GAG ACC 2161 
Glu Glu Arg Ser Glu He Val Asp Phe Ser Val Pro Phe Val Glu Thr 

550 555 



545 



GGC ATC AGC GTC ATG GTG GCG CGC AGC AAT GGC ACG GTG TCC CCC TCG 2209 
Glv He Ser Val Met Val Ala Arg Ser Asn Gly Thr Val Ser Pro Ser 
560 565 570 575 

GCC TTC CTC GAG CCC TAC AGC CCC GCC GTG TGG GTG ATG ATG TTC GTC 2257 
Ala Phe Leu Glu Pro Tyr Ser Pro Ala Val Trp Val Met Met Phe Val 
580 585 590 

ATG TGC CTC ACT GTG GTC GCC GTC ACT GTT TTC ATC TTC GAG TAC CTC 2305 
Met Cys Leu Thr Val Val Ala Val Thr Val Phe He Phe Glu Tyr Leu 
595 . 600 605 

AGT CCT GTT GGT TAC AAC CGC AGC CTG GCC ACG GGC AAG CGC CCT GGC 2 3 53 

Ser Pro Val Gly Tyr Asn Arg Ser Leu Ala Thr Gly Lys Arg Pro Gly 
n 6 2 O 



GGT TCA ACC TTC ACC ATT GGG AAA TCC ATC TGG CTG CTC TGG GCC CTG 2401 
Gly Ser Thr Phe Thr He Gly Lys Ser He Trp Leu Leu Trp Ala Leu 
625 630 635 

GTG TTC AAT AAT TCG GTG CCC GTG GAG AAC CCC CGG GGA ACC ACC AGC 2449 
Val Phe Asn Asn Ser Val Pro Val Glu Asn Pro Arg Gly Thr Thr Ser 
640 645 650 655 

AAA ATC ATG GTG CTG GTG TGG GCC TTC TTC GCC GTC ATC TTC CTC GCC 2497 
Lys He Met Val Leu Val Trp Ala Phe Phe Ala Val He Phe Leu Ala 
660 665 670 

AGC TAC ACA GCC AAC CTG GCC GCC TTC ATG ATC CAG GAG GAG TAC GTG 2 545 

Ser Tyr Thr Ala Asn Leu Ala Ala Phe Met He Gin Glu Glu Tyr Val 
675 680 685 

GAT ACT GTG TCT GGG CTC AGT GAC 'CGC AAG TTC CAG AGG CCC CAG GAG 2593 
Asp Thr Val Ser Gly Leu Ser Asp Arg Lys Phe Gin Arg Pro Gin Glu 
690 695 700 

CAG TAC CCG CCC CTG AAG TTT GGG ACC GTG CCC AAC GGC TCC ACG GAG 2 641 

Gin Tyr Pro Pro Leu Lys Phe Gly Thr Val Pro Asn Gly Ser Thr Glu 

710 ' 715 



705 
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AAG AAC ATC CGC AGC AAC TAT CCC GAC ATG CAC AGC TAC ATG GTG CGC 2 689 

Lys Asn lie Arg Ser Asn Tyr Pro Asp Met His Ser Tyr Met Val Arg 

720 725 730 ~ 735 

TAC AAC CAG CCC CGC GTA GAG GAA GCG CTC ACT CAG CTC AAG GCA GGG 2 73 7 

Tyr Asn Gin Pro Arg Val Glu Glu Ala Leu Thr Gin Leu Lys Ala Gly 
740 745 750 

AAG CTG GAC GCC TTC ATC TAC GAT GCT GCA GTG CTC AAT TAC ATG GCC 2785 
Lys Leu Asp Ala Phe lie Tyr Asp Ala Ala Val Leu Asn Tyr Met Ala 
755 ~ 760 765 

CGC AAG GAC GAG GGC TGC AAG CTT GTC ACC ATC GGC TCC GGC AAG GTC 2833 
Arg Lys Asp Glu Gly Cys Lys Leu Val Thr lie Gly Ser Gly Lys Val 
770 '* 775 ' 780 

TTC GCC ACG ACA GGC TAT GGC ATC GCC CTG CAC AAG GGC TCC CGC TGG 2881 
Phe Ala Thr Thr Gly Tyr Gly lie Ala Leu His Lys Gly Ser Arg Trp 
785 790 795 

AAG CGG CCC ATC GAC. CTG GCG TTG CTG CAG TTC CTG GGG GAT GAT GAG 2929 
Lys Arg Pro lie Asp" Leu Ala Leu Leu Gin Phe Leu Gly Asp Asp Glu 
800 * 805 810 ~ 815 

ATC GAG ATG CTG GAG CGG CTG TGG CTC TCT GGG ATC TGC CAC AAT GAC 2977 
lie Glu Met Leu Glu Arg Leu Trp Leu Ser Gly lie Cys His Asn Asp 
820 825 " ' 830 

AAA ATC GAG GTG ATG AGC AGC AAG CTG GAC ATC GAC AAC ATG GCG GGC 302 5 

Lys lie Glu Val Met Ser Ser Lys Leu Asp lie Asp Asn Met Ala Gly 
835 840 845 

GTC TTC TAC ATG CTC CTG GTG GCC ATG GGC CTG TCC CTG CTG GTC TTC 3073 
Val Phe Tyr Met Leu Leu Val Ala Met Gly Leu Ser Leu Leu Val Phe 
850 855 ' 860 

GCC TGG GAG CAC CTG GTG TAC TGG CGC CTG CGG CAC TGC CTG GGG CCC 3121 
Ala Trp Glu His Leu Val Tyr Trp Arg Leu Arg His Cys Leu Gly Pro 
865 . 870 ~ 875 

ACC CAC CGC ATG GAC TTC CTG CTG GCC TTC TCC AGG GGC ATG TAC AGC 3169 
Thr His Arg Met Asp Phe Leu Leu Ala Phe Ser Arg Gly Met Tyr Ser 
880 885 890 " ~ 895 

TGC TGC AGC GCT GAG GCC GCC CCA CCG CCC GCC AAG CCC CCG CCG CCG 3217 
Cys Cys Ser Ala Glu Ala Ala Pro Pro Pro Ala Lys Pro Pro Pro Pro 
900 905 * 910 

CCA CAG CCC CTG CCC AGC CCC GCG TAC CCC GCG CCG GGG CCG GCT CCC 32 65 

Pro Gin Pro Leu Pro Ser Pro Ala Tyr Pro Ala Pro Gly Pro Ala Pro 
915 920 925 

GGG CCC GCA CCT TTC GTC CCC CGC GAG CGC GCC TCA GTG GCC CGC TGG 3313 
Gly Pro Ala Pro Phe Val Pro Arg Glu Arg Ala Ser Val Ala Arg Trp 
930 935 " 940 

CGC CGG CCC AAG GGC GCG GGG CCG CCG GGG GGC GCG GGC CTG GCC GAC 3361 
Arg Arg Pro Lys Gly Ala Gly Pro Pro Gly Gly Ala Gly Leu Ala Asp 
945 950 955 

GGC TTC CAC CGC TAC TAC GGC CCC ATC GAG CCG CAG GGC CTA GGC CTC 3409 
Gly Phe His Arg Tyr Tyr Gly Pro' lie Glu Pro Gin Gly Leu Gly Leu 
960 965 970 975 

GGC CTG GGC GAA GCG CGC GCG GCA CCG CGG GGC GCA GCC GGG CGC CCG 3457 
Gly Leu Gly Glu Ala Arg Ala Ala Pro Arg Gly Ala Ala Gly Arg Pro 
980 985 990 
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TCC CCG CCG GCC CCT CAG CCC CCG CAG AAG CCG CCG GCC TCC TAT 
Leu Ser Pro Pro Ala Ala Gin Pro Pro Gin Lys Pro Pro Ala Ser Tyr 
995 1000 1005 y 

TTC GCC ATC GTA CGC GAC AAG GAG CCA GCC GAG CCC CCC GCC GGC GCC 
Phe Ala lie Val Arg Asp Lys Glu Pro Ala Glu Pro Pro Ala Gly Ala 

1015 102 0 

TTC CCC GGC TTC CCG TCC CCG CCC GCG CCC CCC GCC GCC GCG GCC ACC 
Phe Pro Gly Phe Pro Ser Pro Pro Ala Pro Pro Ala Ala Ala Ala Sr 
3 1030 1035 

?r T< T 5?° CCG CCA CTC TGC CGC TTG GCC TT C GAG GAC GAG AGC CCG 
Ala Val Gly Pro Pro Leu Cys Arg Leu Ala Phe Glu Asp Glu Ser Pro 

1045 1050 1055 

CCG GCG CCC GCG CGG TGG CCG CGC TCG GAC CCC GAG AGC CAA CCC CTG 
Pro Ala Pro Ala Arg Trp Pro Arg Ser Asp Pro Glu Ser Gin Pro Leu 
1060 2065 1070 

CTG GGG CCA GGC GCG GGC GGC GCG GGG GGC ACG GGG GGC GCA GGC GGA 
Leu Gly Pro Gly Ala Gly Gly Ala Gly Gly Thr Gly Gly Ala Gly Gly 
1075 1080 io85 

Glv a t'a CCG TGC TTC GCC GCG CCG CCC CCG TGC 

Gly Ala Pro Ala Ala Pro Pro Pro Cys Phe Ala Ala Pro Pro Pro Cys 
1090 logs iioo 

III l A ° f T ° ° AT GTC GAC CAG TCG CCG TCG OAC TCG GAG GAC TCG GAG 
Phe Ty^Leu Asp Val Asp Ginger Pro Ser Asp Ser^Glu Asp Ser Glu 

AGC CTG GCC GGC GCG TCC CTG GCC GGC CTG GAT CCC TGG TGG TTC GCC 
Ser Leu Ala Gly Ala Ser Leu Ala Gly Leu Asp Pro Trp Jrp J£ Ala 
X± ^ U 1125 1130 1135 

GAC TTC CCT TAC CCG TAT GCC GAT CGC CTC GGG CSG CCC GCG GCA CGC 
Asp Phe Pro Tyr Pro Tyr Ala Asp Arg Leu Gly Xaa Pro Ala Ala Arg 
H40 H45 11SQ * 

TAG GGA TTG GTC GAC AAA CTA GGG GGC TGG CTC GCC GGG AGC TGG GAC 
Tyr Gly Leu Val Asp Lys Leu Gly Gly Trp Leu Ala Gly Ser rrp Asp 
H55 nee 1165 p p 

TAC CTG CCT CCS CGC AGC GGT CGG GCC GCC TGG CAC TGT CGG CAC TGC 
Tyr Leu ProXaa Arg Ser Gly ArgAla Ala Trp His Cy^Arg SS Cys 

A C a ser Leu r?° o° G ° CG CCG CGC CAT CTC AGC TGC ^CG CAC 

Ala Ser Leu Glu Leu Leu Pro Pro Pro Arg His Leu Ser Cys Ser His 

1190 1195 

A™ GGC GAC GGC GGC TGG TGG GCG CCA CCG CCT CCA CCC TGG GCC 

Asp Gly Leu Asp Gly Gly Trp Trp Ala Pro Pro Pro Pro Pro Trp Ala 

I 205 1210 1215 

Ala G G v Pro T CTG CGC CGG GCC CGC TGC GGG TGC CCG CGG TCG 

Ala Gly Pro Leu Pro Arg Arg Arg Ala Arg Cys Gly Cys Pro Arg Ser 

1220 1225 1230 

SJs Pro St 0 ° GC p CG CGG GCC TCG CAC CGC ACG CCC GCC GC ^ GCC GCG 
Ha.s Pro Hxs Arg Pro Arg Ala Ser 'His Arg Thr Pro Ala Ala Ala Ala 

1235 1240 1245 

Pro HLs Hf C f G CGG CGC GCC GCT GGG GGC TGG GAC CTC CCG 

Pro HiS Hxs Hxs Arg Hxs Arg Arg Ala Ala Gly Gly Trp Asp Leu Pro 

1255 1260 
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3841 



3889 



3937 



3985 



4033 



4081 



4129 
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4225 



4273 
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CCG CCC GCG CCC ACC TCG CGC TCG CTC GAG GAC CTC AGC TCG TGC CCT 4321 

Pro Pro Ala Pro Thr Ser Arg Ser Leu Glu Asp Leu Ser Ser Cys Pro 

1265 1270 1275 

CGC GCC GCC CCT GCG CGC AGG CTT ACC GGG CCC TCC CGC CAC GCT CGC 4369 
Arg Ala Ala Pro Ala Arg Arg Leu Thr Gly Pro Ser Arg His Ala Arg 
1280 1285 1290 1295 

AGG TGT CCG CAC GCC GCG CAC TGG GGG CCG CCG CTG CCT ACA GCT TCC 4417 
Arg Cys Pro His Ala Ala His Trp Gly Pro Pro Leu Pro Thr Ala Ser 
1300 1305 1310 

CAC CGG AGA CAC CGG GGC GGG GAC CTG GGC ACC CGC AGG GGC TCG GCG 4465 
His Arg Arg His Arg Gly Gly Asp Leu Gly Thr Arg Arg Gly Ser Ala 
1315 1320 1325 

CAC TTC TCT AGC CTC GAG TCC GAG GTA TGACGCGGCC CCGGGGGCCC 4512 
His Phe Ser Ser Leu Glu Ser Glu Val 
1330 1335 

CACCGCCCCC TTGGTCAGCG CAGGCCACGG CCCGAGGGGG CGCCCGCAGT GGACAGGACC 4572 

CGCGTGGGTT GGG AAGG AAA GCAGTGGAAC TGGCCGGACC CCGCCTGGAG CAGCGTCCTG 4632 

CGCCCCCTGG TTCTGGAGGA ACCGCAAGCC GG AG AGG ATT TGGTCCCTCA ACTATCACCC 4692 

AGG 4 69 5 

(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1336 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Arg Gly Ala Gly Gly Pro Arg Gly Pro Arg Gly Pro Ala Lys Met 
1 5 10 15 

Leu Leu Leu Leu Ala Leu Ala Cys Ala Ser Pro Phe Pro Glu Glu Ala 
20 25 30 

Pro Gly Pro Gly Gly Ala Gly Gly Pro Gly Gly Gly Leu Gly Gly Ala 
35 40 45 

Arg Pro Leu Asn Val Ala Leu Val Phe Ser Gly Pro Ala Tyr Ala Ala 
50 55 60 

Glu Ala Ala Arg Leu Gly Pro Ala Val Ala Ala -Ala Val Arg Ser Pro 
65 " 70 75 80 

Gly Leu Asp Val Arg Pro Val Ala Leu Val Leu Asn Gly Ser Asp Pro 
85 90 95 

Arg Ser Leu Val Leu Gin Leu Cys Asp Leu Leu Ser Gly Leu Arg Val 
100 105 110 

His Gly Val Val Phe Glu Asp Asp Ser Arg Ala Pro Ala Val Ala Pro 
115 120 125 

lie Leu Asp Phe Leu Ser Ala Gin Thr Ser Leu Pro lie Val Ser Glu 
130 * 135 140 



# 
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His Gly Gly Ala Ala Leu Val Leu Thr Pro Lys Glu Lys Gly Ser Thr 
145 ' 1 150 155 160 

Phe Leu His Leu Gly Ser Ser Pro Glu Gin Gin Leu Gin Val lie Phe 
165 170 175 

Glu Val Leu Glu Glu Tyr Asp Trp Thr Ser Phe Val Ala Val Thr Thr 
180 ' 185 190 

Arg Ala Pro Gly His Arg Ala Phe Leu Ser Tyr lie Glu Val Leu Thr 
195 200 205 

Asp Gly Ser Leu Val Gly Trp Glu His Arg Gly Ala Leu Thr Leu Asp 
210 215 220 

Pro Gly Ala Gly Glu Ala Val Leu Ser Ala Gin Leu Arg Ser Val Ser 
225 J 230 235 240 

Ala Gin He Arg Leu Leu Phe Cys Ala Arg Glu Glu Ala Glu Pro Val 
245 250 255 

Phe Arg Ala Ala Glu-Glu Ala Gly Leu Thr Gly Ser Gly Tyr Val Trp 
260 265 270 

Phe Met Val Gly Pro Gin Leu Ala Gly Gly Gly Gly Ser Gly Ala Pro 
275 " 280 285 

Gly Glu Pro Pro Leu Leu Pro Gly Gly Ala Pro Leu Pro Ala Gly Leu 
290 295 300 

Phe Ala Val Arg Ser Ala Gly Trp Arg Asp Asp Leu Ala Arg Arg Val 
305 310 315 320 

Ala Ala Gly Val Ala Val Val Ala Arg Gly Ala Gin Ala Leu Leu Arg 
325 330 335 

Asp Tyr Gly Phe Leu Pro Glu Leu Gly His Asp Cys Arg Ala Gin Asn 
340 345 350 

Arg Thr His Arg Gly Glu Ser Leu His Arg Tyr Phe Met Asn He Thr 
355 ~ 360 365 

Trp Asp Asn Arg Asp Tyr Ser Phe Asn Glu Asp Gly Phe Leu Val Asn 
370 " 375 380 

Pro Ser Leu Val Val He Ser Leu Thr Arg Asp Arg Thr Trp Glu Val 
385 390 395 400 

Val Gly Ser Trp Glu Gin Gin Thr Leu Arg Leu Lys Tyr Pro Leu Trp 
405 410 415 

Ser Arg Tyr Gly Arg Phe Leu Gin Pro Val Asp Asp Thr Gin His Leu 
420 425 _ . 430 

Ala Val Ala Thr Leu Glu Glu Arg Pro Phe Val He Val Glu Pro Ala 
435 440 445 

Asp Pro He Ser Gly Thr Cys He Arg Asp Ser Val Pro Cys Arg Ser 
450 455 460 

Gin Leu Asn Arg Thr His Ser Pro Pro Pro Asp Ala Pro Arg Pro Glu 
465 470 475 480 

Lys Arg Cys Cys Lys Gly Phe Cys He Asp He Leu Lys Arg Leu Ala 
485 490 495 



m 
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His Thr lie Gly Phe Ser Tyr Asp Leu Tyr Leu Val Thr Asn Gly Lys 
500 505 * 510 

His Gly Lys Lys lie Asp Gly Val Trp Asn Gly Met lie Gly Glu Val 
515 520 ^ 525 

Phe Tyr Gin Arg Ala Asp Met Ala lie Gly Ser Leu Thr lie Asn Glu 
530 535 540 

Glu Arg Ser Glu lie Val Asp Phe Ser Val Pro Phe Val Glu Thr Gly 
545 550 555 560 

lie Ser Val Met Val Ala Arg Ser Asn Gly Thr Val Ser Pro Ser Ala 
565 570 575 

Phe Leu Glu Pro Tyr Ser Pro Ala Val Trp Val Met Met Phe Val Met 
580 585 590 

Cys Leu Thr Val Val Ala Val Thr Val Phe lie Phe Glu Tyr Leu Ser 
595 600 605 

Pro Val Gly Tyr Asn" Arg Ser Leu Ala Thr Gly Lys Arg Pro Gly Gly 
610 615 620 

Ser Thr Phe Thr lie Gly Lys Ser lie Trp Leu Leu Trp Ala Leu Val 
625 630 635 640 

Phe Asn Asn Ser Val Pro Val Glu Asn Pro Arg Gly Thr Thr Ser Lys 
645 650 655 

lie Met Val Leu Val Trp Ala Phe Phe Ala Val lie Phe Leu Ala Ser 
660 665 670 

Tyr Thr Ala Asn Leu Ala Ala Phe Met lie Gin Glu Glu Tyr Val Asp 
675 680 685 

Thr Val Ser Gly Leu Ser Asp Arg Lys Phe Gin Arg Pro Gin Glu Gin 
690 695 700 

Tyr Pro Pro Leu Lys Phe Gly Thr Val Pro Asn Gly Ser Thr Glu Lys 
705 710 715 ~ 720 

Asn lie Arg Ser Asn Tyr Pro Asp Met His Ser Tyr Met Val Arg Tyr 
725 730 ^ 735 

Asn Gin Pro Arg Val Glu Glu Ala Leu Thr Gin Leu Lys Ala Gly Lys 
740 745 750 

Leu Asp Ala Phe lie Tyr Asp Ala Ala Val Leu Asn Tyr Met Ala Arg 
755 760 765 

Lys Asp Glu Gly Cys Lys Leu Val Thr lie Gly Ser Gly Lys Val Phe 
770 775 780 

Ala Thr Thr Gly Tyr Gly lie Ala Leu His Lys Gly Ser Arg Trp Lys 
785 790 795 " ~ 800 

Arg Pro lie Asp Leu Ala Leu Leu Gin Phe Leu Gly Asp Asp Glu lie 
805 810 ' * 815 

Glu Met Leu Glu Arg Leu Trp Leu Ser Gly lie Cys His Asn Asp Lys 
820 '825 * 830 

He Glu Val Met Ser Ser Lys Leu Asp lie Asp Asn Met Ala Gly Val 
835 840 845 



138 



Phe Tyr Met Leu Leu Val Ala Met Gly Leu Ser Leu Leu Val Phe Ala 
850 855 860 

Trp Glu His Leu Val Tyr Trp Arg Leu Arg His Cys Leu Gly Pro Thr 
865 870 875 ' 880 

His Arg Met Asp Phe Leu Leu Ala Phe Ser Arg Gly Met Tyr Ser Cys 
885 890 " " 895 

Cys Ser Ala Glu Ala Ala Pro Pro Pro Ala Lys Pro Pro Pro Pro Pro 
900 905 * 910 

Gin Pro Leu Pro Ser Pro Ala Tyr Pro Ala Pro Gly Pro Ala Pro Gly 
915 920 925 

Pro Ala Pro Phe Val Pro Arg Glu Arg Ala Ser Val Ala Arg Trp Arq 
930 935 940 

Arg Pro Lys Gly Ala Gly Pro Pro Gly Gly Ala Gly Leu Ala Asp Gly 
94 5 950 955 96 0 

Phe His Arg Tyr Tyr-"Gly Pro He Glu Pro Gin Gly Leu Gly Leu Gly 
965 970 " 975 

Leu Gly Glu Ala Arg Ala Ala Pro Arg Gly Ala Ala Gly Arg Pro Leu 
980 985 990 

Ser Pro Pro Ala Ala Gin Pro Pro Gin Lys Pro Pro Ala Ser Tyr Phe 
995 1000 ' 1005 

Ala He Val Arg Asp Lys Glu Pro Ala Glu Pro Pro Ala Gly Ala Phe 
1010 1015 1020 

Pro Gly Phe Pro Ser Pro Pro Ala Pro Pro Ala Ala Ala Ala Thr Ala 
1025 1030 1035 1040 

Val Gly Pro Pro Leu Cys Arg Leu Ala Phe Glu Asp Glu Ser Pro Pro 
1045 1050 1055 

Ala Pro Ala Arg Trp Pro Arg Ser Asp Pro Glu Ser Gin Pro Leu Leu 
1060 1065 1070 

Gly Pro Gly Ala Gly Gly Ala Gly Gly Thr Gly Gly Ala Gly Gly Gly 
1075 1080 " 1085 

Ala Pro Ala Ala Pro Pro Pro Cys Phe Ala Ala Pro Pro Pro Cys Phe 
1090 1095 iioo 

Tyr Leu Asp Val Asp Gin Ser Pro Ser Asp Ser Glu Asp Ser Glu Ser 
H05 mo ins 112Q 

Leu Ala Gly Ala Ser Leu Ala Gly Leu Asp Pro Trp Trp Phe Ala Asp 
1125 H30 ~ H35 

Phe Pro Tyr Pro Tyr Ala Asp Arg Leu Gly Xaa Pro Ala Ala Arg Tyr 
1140 H45 H50 

Gly Leu Val Asp Lys Leu Gly Gly Trp Leu Ala Gly Ser Trp Asp Tyr 

H60 H65 

Leu Pro Xaa Arg Ser Gly Arg Ala Ala Trp His Cys Arg His Cys Ala 
H70 H75 < 1180 

??L LeU G1U Leu Leu Pro Pro Pro Ar 9 His Leu Ser c y s Ser His Asp 
1185 H90 1195 1200 
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Gly Leu Asp Gly Gly Trp Trp Ala Pro Pro Pro Pro Pro Trp Ala Ala 
1205 1210 1215 

Gly Pro Leu Pro Arg Arg Arg Ala Arg Cys Gly Cys Pro Arg Ser His 
1220 1225 " " 1230 

Pro His Arg Pro Arg Ala Ser His Arg Thr Pro Ala Ala Ala Ala Pro 
1235 1240 1245 

His His His Arg His Arg Arg Ala Ala Gly Gly Trp Asp Leu Pro Pro 
1250 1255 " 1260 

Pro Ala Pro Thr Ser Arg Ser Leu Glu Asp Leu Ser Ser Cys Pro Arg 
1265 1270 1275 1280 

Ala Ala Pro Ala Arg Arg Leu Thr Gly Pro Ser Arg His Ala Arg Arg 
1285 * 1290 1295 

Cys Pro His Ala Ala His Trp Gly Pro Pro Leu Pro Thr Ala Ser His 
1300 1305 1310 

Arg Arg His Arg Gly Gly Asp Leu Gly Thr Arg Arg Gly Ser Ala His 
1315 1320 1325 

Phe Ser Ser Leu Glu Ser Glu Val 
1330 1335 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGGTGGCGGC CGCAGAGCAC CTCCACCATC TCCTTGTCCT ACTCCAAGAT CTGGCCCTAG 60 
TCCATGTTTG C 71 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TGGTGGTCCC CAACCTGTAG GACTTGGTTC TGGAGGAGGA TCTGGTGTAG GCAAACATGG 
ACTAGGGCCA G 



60 
71 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GTTGGGGACC ACCAGATGGA GGTAGAGCTG CACTTGTACG AAGAGCTCCA CAACCACCTG 60 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: both 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



CGTGAGACGT CAGACAAAGG AGGCCCAGGT GTAGGTGGTC TACCAGGTGG TTGTGGAGCT 60 

CT 62 
(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 

CCGCAGAGCA CCTCCACCAT CTCCTTGTCC TACTCCAAGA TCTGGCCCTA GTCCATGTTT 60 

GCCTACACCA GATCCTCCTC CAGAACCAAG TCCTACAGGT TGGGGACCAC CAGATGGAGG 120 

TAGAGCTGCA CTTGTACGAA GAGCTCCACA ACCACCTGGT AGACCACCTA CACCTGGGCC 180 
TCCTTTGTCT GACGT iq c 



